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METHOD AND APPARATUS FOR DYNAMIC DATA FLOW CONTROL 

RELATED APPLICATIONS 

5 This application claims the benefit of U.S. Provisional 

Application Serial No. 60/180,587, filed February 7, 2000 by 
Keith A. Lowery, et al . , attorney's docket number 066241.0105, 
and entitled, "Method and Apparatus for Dynamic Data Flow 
Control " . 

0 

TECHNICAL FIELD OF THE INVENTION 

This invention relates in general to client-server 

systems and, more particularly, to a method and apparatus for 
dynamic data flow control. 
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BACKGROUND OF THE INVENTION 

As computers have grown increasingly important in today's 
society, the importance of the Internet has also increased. 
As increased numbers of users access the Internet, the need 
for a web server to handle large numbers of data requests has 
also increased. In addition, as increasing numbers of 

requests are handled by the Internet, the delay experienced by 

the user between generating the request and receiving a 

response is also increasing. 

Some traditional solutions to handling large numbers of 
data requests and decreasing the delay at the web server have 
involved increasing the processing capability of the web 

server and distributing the processing of the requests across 
groups of web servers. Other traditional solutions have 
involved increasing the bandwidth of the network connection 
between the Internet and the web server or servers. However, 
traditional solutions have often failed as the number of 
requests continue to increase because of the expense 

associated with maintaining large numbers of web servers and 
high speed connections to the Internet. 

Electronic commerce often involves large amounts of 
different types of data. Similar to traditional brick and 
mortar stores, different people have different values to the 
retailer. For example, customers who are purchasing items at 
a check-out counter typically receive faster service than 
customer who are simply browsing since the store knows that 
the purchasing customer will be spending money while the 
browsing customer who may not purchase anything. Also, 
regular customers are often known to the sales staff and 
receive faster and more personalized service. Traditional 

electronic commerce systems have often failed similar levels 



DAL01: 489003 .3 
066241 . 0105 



ATTORNEY'S DOCKET PATENT APPLICATION 

066241.0105 3 

of provide faster and more personalized service to different 
electronic commerce customers. 



O 
■»C 
iri 

ry 

m 
m 

is 

a- 

m 
a 
m 
a 

a 



DAL01:469003.3 

066241 . 0105 



ATTORNEY'S DOCKET 
066241 . 0105 



4 



PATENT APPLICATION 



SUMMARY OF THE INVENTION 

From the foregoing, it may be appreciated that a need has 
arisen for a method and apparatus for dynamic data flow 
control so as to provide increased efficiency for handling 
5 large numbers of requests by a web site. 

According to one embodiment of the present invention, a 

method for data processing is provided which comprises 
receiving a data request at a data center. The method further 
comprises assigning a priority to the request. Also, the 

10 method comprises queuing the request as a function of the 
priority associated with the request and retrieving the 
requested data from an origin server. 

According to another embodiment of the present invention, 
a system for dynamic flow control is provided which comprises 

15 a cache server operable to receive a request for content and 
retrieve content from an origin server in response thereto and 
a flow control server having an associated queue and coupled 
to the cache server . The flow control server is operable to 
determine assign a priority to the request. The flow control 

2 0 server is also operable to store the prioritized request in 

the queue as a function of the priority associated with the 
request and regulate the retrieval of content from the origin 
server by the cache server. 

The present invention provides a number of technical 
25 advantages. One such technical advantage is the capability to 
prioritize requests based on a request and an origin server to 
which the request is addressed for controlling the processing 
load at the origin server. Another such technical advantage 
is the capability to configure a data center for independent 

3 0 control of the processing load experienced by different origin 

servers. A further technical advantage is the capability to 

provide status information about the request to a user 
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associated with the request based on the delay in processing 
the request and other configurable criteria associated with 
the origin server. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

A better understanding of the present invention will be 
realized from the detailed description that follows, taken in 
conjunction with the accompanying drawings, in whi.ch: 

FIGURE 1 is a block diagram of a web content caching 
system; 

FIGURE 2 is an exemplary network illustrating network 

distance between a source node and a destination node; 

FIGURE 3 is block diagram illustrating details of the web 
content caching system of FIGURE 1; 

FIGURE 4 is a block diagram illustrating details of a 
flow control server usable with the present invention; 

FIGURE 5 is a block diagram illustrating details of a 

cache server system usable with the present invention; and 

FIGURE 6 is a flowchart illustrating the data caching and 
dynamic flow control method of the web content caching system 
of FIGURE 1. 
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DETAILED DESCRIPTION OF THE INVENTION 

FIGURE 1 is a block diagram of a web content caching 

system 10 . Caching system 10 comprises a plurality of clients 
11 , a browser 12 executing on client 11, a network 13, one or 
5 more data centers 14, a data center manager 16, an origin 
server 18, a data source 20 and a Domain Name Service (DNS) 
server 23. Caching system 10 provides increased World Wide 

Web ("web") performance to clients 11. Performance is 

increased by caching data at the data centers 14 so that the 

10 data is located closer, in terms of network distance, to the 
clients 11, by decreasing the volume of data requests handled 
by origin server 18 and by decreasing the volume of dynamic 
data to be generated by origin server 18. It should be 
understood that while the disclosed embodiment is discussed in 

15 terms of the Internet Protocol (IP) , any suitable network 
protocol could be used, such as Asynchronous Transfer Mode 

(ATM) , frame relay and Fiber Distributed Data Interface 
(FDDI ) . 

Each client 11 comprises any suitable general purpose or 
20 specialized computer system operable to support execution of 
browser 12 . Clients 11 respectively comprise a processor 15 
and a computer readable memory 17 coupled to processor 15. 
Each client 11 is coupled to network 13. Browser 12 executes 

on client 11 and comprises any suitable Hypertext Transport 
25 Protocol (HTTP) client. In the disclosed embodiment, browser 
12 comprises a web browser such as Internet Explorer® by 
Microsoft Corp. of Redmond, Washington or Netscape 
Communicator by Netscape Communications Corp. Browser 12 is 

held in memory 17 and executed by processor 15. Browser 12 
3 0 transmits and receives data over network 13 . Each browser 12 
is operable to generate one or more requests 22 . Each request 
22 is generated by a particular browser 12 in response to a 
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desire for information by a user (not shown) associated with 
that particular browser 12. Each request 22 comprises a 
request for one item of content from origin server 18. The 
requested item of content may further comprises multiple 
5 elements of content, for example, a web page with multiple 
graphic elements therein, but the request is directed to 
retrieving the single content item while the remaining 

elements of content in the main content item are retrieved as 
a function of the single content item. Content comprises 

10 static and dynamic audio data, video data, text data, 
multimedia data, Hypertext Markup Language (HTML) data, binary 
data and any other suitable types of data capable of being 
used by browser 12. In the disclosed embodiment, requests 22 
are HTTP requests for HTML data, such as web pages. An item 

15 of content ("content item") indicates a particular element of 
content, such as a particular web page, while content refers 
generally to data available to be retrieved. 

Content comprises static information and dynamic 
information. Static information typically does not change 

2 0 once the content has been created or does not change except 
slowly over time. Stated another way, static information 
comprises content that, when created, is expected to remain 
the same for an indeterminate amount of time. Static 
information may comprise, for example, text files and pre- 

25 existing Hypertext Markup Language (HTML) web pages. For 
example, a restaurant menu is the same regardless of the user 
who is accessing the menu because the menu is the same for all 
people. However, as chefs at the restaurant change over time, 
the menu may also change. In contrast, dynamic information 

30 comprises content that is expected and designed to change. 

The dynamic content may change based on the data and criteria 
used for generating the dynamic content, such as a search 
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result page. For example , a search for "mountain lions" will 
dynamically generate a search result page listing items 
related to "mountain lions." The dynamic content may also 
change based on information associated with the page, for 
5 example , a page in an online product catalog may change in 

response to pricing and availability changes, and dynamic 
content may further change for other suitable reasons. 
Dynamic information may comprise, for example, the output of 
Common Gateway Interface (CGI) scripts, search engine results, 

10 the output of Structured Query Language (SQL) searches and the 
SQL search itself, JavaScript, the output of JavaScript 
scripts, ActiveX controls and the output of ActiveX controls, 
Active Server Pages, applets and any other suitable dynamic 
content generation system. Often, the dynamic information 

15 will be expressed as a single HTML web page, but the 

information within the HTML web page has been generated 
dynamically based on some suitable criteria. For example, the 

result o£ a search using a search engine on the Internet 
returns different information based on the search terms 
2 0 provided by the user. The search results may also depend on 

one or more attributes associated with the search request, 

such as geographic location or the current date. Continuing 
the search engine example, when the user is searching for 
local businesses, the search results will vary not only on the 

25 search terms provided by the user, but also what "local" means 

with respect to the search request . For another example, a 
user searching for information about current events will want 
results that are tailored to the user's search terms, the 
user's location, and the current date. For yet another 
30 example, an online book retailer may provide price information 

and consumer reviews of books available from the online book 

retailer. The price information for a particular book may 
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change unexpectedly in response to a sale or a sudden interest 
in that particular book. Also, the web page listing the book 

and the consumer reviews changes in response to a consumer 

entering a new review of the book. One aspect of the system 
5 of the present invention allows for the efficient caching of 
static information that never changes, static information, 
such as the menu example, that changes infrequently, and some 
suitable types of dynamic information. 

Network 13 comprises any suitable data networking system 

10 for communicating data between computer systems. For example, 

network 13 may be the Internet, an asynchronous transfer mode 
("ATM") network, an Ethernet network, a Transmission Control 
Protocol/Internet Protocol (TCP/IP) network, an intranet or 
any other suitable computer networking technologies. For 
15 purposes of teaching the present invention, an exemplary 

embodiment will be described where network 13 comprises the 

Internet . 

Data centers 14 cache content for transmission to 

browsers 12 in response to requests 22. Data center 14 

2 0 responds to request 22 from browser 12 by providing the 
requested web page or other data to browser 12 using network 
13 . Data centers 14 are described in more detail in 
association with FIGURE 3. In the disclosed embodiment, data 
centers 14 cache content from origin server 18. Data centers 

25 14 provide current, cached content originally available from 
origin server 18 to a browser 12 in response to request 22 . 
In the disclosed embodiment, data centers 14 are shown as 
communicating with a single exemplary origin server 18, 
however, data centers 14 are operable to communicate with and 

30 provide service to a plurality of origin servers 18. 

Caching content comprises storing a copy of content 
available at or generated by origin server 18. Content is 
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initially generated at origin server 18 and a copy is stored 
("cached") at data centers 14. The copy is retrieved by a 
particular data center 14, such as data center 14A, in 
response to receiving a particular request 22 requesting 
5 particular content. If that particular content is not 

available at data center 14A, then that particular content is 

retrieved from origin server 18. The caching process is 
described in more detail in association with FIGURE 5. A copy 
of the content available from origin server 18 may be stored 

10 at one particular data center 14, a plurality o£ the data 
centers 14 or all of the data centers 14 depending on which 
data centers 14 receive requests 22 for that content. 

As discussed previously, some content may be dynamic in 
the sense that it may become inaccurate or superceded over 

15 time. To address this situation, content currently existing 
at the data centers 14 may be marked as expired in response to 
an expiration command 24 (discussed in detail in association 
with FIGURE 5) from data center manager 16. Content may 
alternatively be actively deleted in response to expiration 

20 command 24. Expired content is considered locally unavailable 
at data centers 14 by data centers 14 and is retrieved from 
origin server 18 in response to a new request 22 for the 
particular content. 

Caching content at data centers 14 decreases the number 

25 of requests 22 that must be serviced by origin server 18. In 
the disclosed embodiment, since data centers 14 will request 

content from origin server 18 only when the content is locally 

unavailable at data center 14, origin server 18 will be 
required to generate or provide a particular content item a 
30 maximum number of times. Typically, the maximum number of 
content generation occurrences equals the total number of data 
centers; however, the number of content generation occurrences 
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may exceed the total number of data centers. For example, if 

10 data centers are used, then the origin server will only 
have to generate a particular content item at most 10 times, 
once for each data center, because after the particular 
5 content item is generated for each data center 14, the data 

centers 14 will provide the particular content item to browser 
12 instead of origin server 18 providing the content item to 
browser 12. By spreading requests 22 over a plurality of data 
centers 14, response time is decreased because the processing 
10 load associated with servicing the requests is spread over 

multiple computers at multiple locations. Also, in the 

disclosed embodiment, data centers 14 are located closer, in 
terms of network distance, to clients 11. Network distance is 
described in more detail in association with FIGURE 2 . For 

15 example, data center 14A is closer to client 11A, data center 
14B is closer to client 11B, and data center 14C is closer to 
client 11C. However, any data center 14A-C may service 
requests from browser 12 on any client 11A-C. Locating data 
centers 14 closer to clients 11 decreases the response time 

20 for handling requests by decreasing the amount of time that 
the response and the request spends in transit over network 13 
from a particular data center 14 to a particular client 11. 

Data center manager 16 is coupled to data source 2 0 and 
data centers 14. Data center manager 16 operates to expire 

25 content at data centers 14. In the disclosed embodiment, data 
center manager 16 comprises a computer system executing 
software which is operable to expire content at data centers 
14 . Data center manager 16 expires content at data centers 14 
by sending data expiration commands 24 to data centers 14. 

3 0 Data center manager 16 may generate data expiration command 
24, for example, in response to a change in the content at 
data source 20. Specifically, data center manager 16 
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generates data expiration command 24 in response to a data 

change message 28 (discussed in more detail in association 

with FIGURE 5) from data source 20. Data center manager 16 
may also generate data expiration command 24 in response to 
5 the elapsing of a predetermined time period. For example, data 
center 16 may know that certain content changes at periodic 
intervals at data source 2 0 and independently generates a data 
expiration command 24 for that content. 

Data source 2 0 stores data used for generating content 

10 and content itself. Data source 20 comprises magnetic storage 
devices, optical storage devices, persistent storage devices, 
transient storage devices, and any other suitable storage 
device or technology for storing data. Data source 2 0 may be 
a portion of origin server 18 or may be physically separate 

15 therefrom. Data source 20 includes a trigger portion 21. 

Trigger portion 21 of data source 20 generates data change 
message 2 8 indicating changes in the content or portions of 
content available from origin server 18 and stored in data 

source 20. Trigger portion 21 may also indicate a change in 

2 0 the data based on the time and date. Data change message 2 8 is 

then communicated to data center manager 16. 

Origin server 18 is coupled to data centers 14 and data 
source 20. Origin server 18 comprises software executing on a 

computer for receiving and responding to requests 22. Origin 
25 server 18 may comprise a single computer executing software or 
may comprise a plurality of computers each executing software. 
In the disclosed embodiment, origin server 18 is associated 
with a single Internet domain; however, multiple origin 
servers 18 associated with multiple distinct Internet domains 

3 0 are also usable in connection with the present invention. 

Also, in the disclosed embodiment, origin server 18 is a 
hypertext transport protocol (HTTP) server which is also known 
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as a web server. Origin server 18 may additionally support 

other protocols such as the file transfer protocol (FTP) and 
the gopher protocol . Origin server 18 retrieves information 
from data source 2 0 in response to requests 22 forwarded from 
5 data centers 14. Origin server 18 is operable to retrieve 
static content, such as prewritten text files, images, and web 
pages, from data source 20 in response to requests 22. Origin 
server 18 is also operable to generate new, dynamic content, 
for example, by dynamically creating web pages based on 

10 content stored in data source 20 in response to requests 22. 
For example, origin server 18 may generate a new web page 
using a. common gateway interface (CGI) script, generate a new 
web page from the result of a structured query language (SQL) 
request and perform other suitable content generation 

15 functions. Origin server 18 may also be operable to generate 
executable software, such as applications and applets, in 

response to requests for data. For example, origin server 18 

may generate a Java applet in response to an appropriate 
request 22 . 

2 0 DNS server 2 3 comprises any suitable DNS server for 

translating machine and domain names into numeric Internet 
Protocol (IP) addresses. DNS server 23 may also comprise a 
single DNS server or multiple DNS servers; however, DNS server 
23 is discussed as if it were a single DNS server. In order 

25 for origin server 18 to be served by data centers 14, DNS 
server 2 3 resolves machine and domain names associated with 
origin server 18 to data centers 14 . Specifically, all or a 
portion of the domain names and machine names associated with 
origin server 18 are translated to IP addresses associated 

30 with data centers 14. Requests 22 directed to origin server 
18 will now be received by data centers 14 from browsers 12 
before requests 22 reach origin server 18. Specifically, 
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requests 22 directed to domain names and machine names 
associated with origin server 18 resolve to IP addresses data 
centers 14 . Any subset of the domain names and machine names 
in any suitable combination may be redirected to data centers 
5 14. The subset can be changed by the administrator associated 
with origin server 18 at any time so as to provide flexibility 
as to whether content at origin server 18 is cached. Data 
centers 14 can then control the flow of requests 22 between 
browsers 12 and origin server 18 . 
10 In operation, browsers 12 generate requests 22 for 

content from origin server 18 . One of the data centers 14 
receives request 22 . Data centers 14 receive request 22 even 
though request 22 is directed at origin server 18 because DNS 
resolution has directed request 22 to data center 14 . Stated 

15 another way, requests 22 addressed to origin server 18 are 

routed to data centers 14. For example, if browser 12 on 
client 11A generates request 22, then data center 14A receives 
request 22 before request 22 reaches origin server 18. 
Specifically, request 22 has an associated address portion 

20 indicating the destination web site for request 22. Since DNS 
server 23 resolves one of data centers 14 as the destination 
instead of origin server 18, data centers 14 can receive 
request 22 by resolving the address portion of request 22 to 
the address of the closest , in network distance , data center 

25 14 to browser 12 using DNS server 23. 

Continuing the above example, data center 14A then 
determines whether the requested content has been cached at 
data center 14 A. I f the requested content is cached at data 
center 14A, then request 22 is handled at data center 14A and 

30 the requested content is returned to browser 12, If the 

requested content is not available at data center 14A, request 
22 is forwarded to origin server 18 for handling. When origin 
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server 18 is required to handle request 22, the requested 

content will first be communicated to data center 14A for 

caching, if the requested content is to be cached, and then 
transmitted from data center 14A to browser 12 on client 11A. 
5 FIGURE 2 is an exemplary network illustrating network 

distance between a source node 50 and a destination node 52 in 
a packet switched network. Source node 5 0 comprises any 
suitable network device, such as a computer, operable to 
generate a data packet for communication to destination node 

10 52 . Destination node 52 comprises any suitable network device 
for receiving data packets from source node 50 , such as a 
computer. Between source and destination nodes 50 and 52 is a 
communication network 53 such as the Internet. Communications 
network 53 comprises a plurality of nodes 54, 56, 58, 60, and 

15 62. Each node 54, 56, 58, 60 and 62 comprises any suitable 
network device for receiving data packets and transmitting 
data packets, such as a computer or router. Each of the nodes 
50, 52, 54, 56, 58, 60 and 62 are coupled by communications 
links 64 . Each of the communications links 64 has a latency 

2 0 associated therewith. The latency represents the sum of the 

time a packet has spent waiting to be forwarded out of a 
particular node and the transit time over the link, for 
convenience, the latency is associated with a particular link, 
even though a majority of the latency time will typically be 

2 5 spent by the packet while waiting to be forwarded out of the 
node. The latency is indicated by a time in milliseconds. 
For example, communications link 64 between nodes 54 and 56 
has a latency of 45 milliseconds. 

Distance between the source and destination 50 and 52 may 

30 be determined in various ways. In the exemplary configuration 
of FIGURE 2, only hop count and latency between the source and 
destination 50 and 52 are considered, however, other measures 
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of network distance are also usable. The distance between the 
source and destination 50 and 52 may be determined by counting 

the number of nodes between the source node 50 and the 

destination node 52 ("hop count") . For example, the distance 
5 from source node 50 to destination 52 is three hops along the 
path defined by nodes 54 and 56 and is four hops along the 
path defined by nodes 58, 60 and 62. Each hop represents one 
communication link 64 that a packet traverses. Thus, from 

source node 50 to node 54 is one hop, from node 54 to 56 is 
10 another hop, and from node 56 to destination node 52 is 
another hop. 

The latency of communications links 64 may also be 
considered in determining network distance. For example , a 
packet traveling from source node 5 0 to node 54 will take five 
15 milliseconds to travel across the communications link 64 

between source node 50 and node 54. Similarly, a data packet 

traveling from source node 50 to node 58 will experience a ten 
millisecond delay as the packet crosses communications link 64 
between source node 50 and node 58. The total latency 

20 experienced by a data packet traveling from source node 50 to 

destination node 52 is the sum of the latencies of each 
communication link 64 traversed on the path from source node 
50 to destination node 52. Thus, the latency from source node 
50 to destination node 52 along the path defined by nodes 54 

25 and 56 is sixty milliseconds (5 milliseconds + 45 milliseconds 
+ 10 milliseconds) , The total latency between source node 50 
and destination node 52 along the path defined by nodes 58, 60 
and 62 is thirty milliseconds (10 milliseconds + 5 
milliseconds + 5 milliseconds + 10 milliseconds) . Thus, while 

30 the number of hops between source node 50 and destination node 
5 2 through nodes 54 and 5 6 includes fewer hops, the latency is 
much higher than along the path of nodes 58, 60 and 62, By 
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considering both the number of hops and the latency, the 
distance between the source node 50 and destination 52 may be 
determined and the faster and shorter route can be chosen. 
Also, the latency of any particular communications link 64 may 
5 not be a fixed value. For example , the link 64 between nodes 
54 and 56 is shown as 45 milliseconds which may result when a 
large amount of data is being communicated between nodes 54 
and 56. As the amount of network traffic between 54 and 56 
decreases, the latency may decrease as well. Thus, while in 
10 the exemplary network of FIGURE 2, the path across nodes 58, 
60 and 62 has a lower latency than the path along nodes 54 and 
56 , at some other time the path along nodes 54 and 56 may be 
more optimal than the path along nodes 58, 60 and 62. The 
determination of network distance discussed above is also 

15 applicable to determining which of a plurality of destination 
nodes has the shortest path from a single source node. 

FIGURE 3 is block diagram illustrating details of web 
caching system 10. Data center manager 16 comprises a crawler 
application 29. Crawler application 29 comprises a software 

2 0 application executed by data center manager 16 for assisting 

an administrator associated with origin server 18 in 
determining which content available from origin server 18 
should be cached at data centers 14, status information 
(described later in association with flow control server 34 
25 and in detail in association with FIGURE 4) associated with 
particular content items , priority information (described in 
later in association with flow control server 34 and in detail 
in association with FIGURE 4) associated with requests, such 
as requests 22 and other suitable configuration information 

3 0 associated with controlling the caching of content from origin 

server 18. Data center 14 comprises a web server 30, a cache 
server 32, a flow control server 34 and communication links 
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3 6A and 3 6B. Web server 3 0 is coupled to communications 
network 13 and is operable to receive request 22 and generate 
responses thereto. Web server 3 0 is further coupled to cache 
server 32 over communication link 36A. Cache server 32 is 
5 coupled to flow control server 34 over communication link 36B 
and to origin server 18 over network 13 . Cache server 3 2 is 
operable to store content requested from web server 3 0 and 
provide requested content to web server 30. Flow control 
server 34 is operable to control cache server 32 by 
10 determining when cache server 32 may request content from 
origin server 18. 

Web server 3 0 comprises any suitable server application 
operable to retrieve data in response to requests 22 . In the 
disclosed embodiment; web server 30 comprises an HTTP server 

15 ("web server") , such as the Internet Information Server® by 
Microsoft Corp. of Redmond, Washington and the Apache web 
server by the Apache Software Foundation of Forest Hill, 
Maryland. Web server 3 0 receives request 22 and examines 
request 2 2 to determine what content is requested by request 

20 22. Web server 30 then requests the desired content from 

cache server 32. Alternatively, web server 30 may only 

determine whether the requesting user is allowed access to the 

requested content, decrypt Secure Socket Layer (SSL) encrypted 

content requests and then pass the request over to cache 
25 server 32 for handling. After receiving the requested content 

from cache server 32, web server 3 0 provides the requested 

content to browser 12 . 

One example of a cache server that could be used as cache 

server 32 is shown in FIGURE 5 . Cache server 32 comprises a 
3 0 software program executing on a data processing system for 

storing and retrieving content in response to information 

requests from web browser 12. For example, cache server 32 
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may comprise the Squid caching application. Specifically, 
cache server 32 operates to store content, such as web pages , 
previously requested by browser 12 . 

In operation, cache server" 32 provides tra.nspa.rent proxy, 
5 content caching, content synchronization, flow control and 
status response functionality. Transparent proxy 

functionality comprises passing content and requests through 
cache server 3 2 and not altering the content or the request as 
seen by browser 12, origin server 18 or web server 30. Both 
10 browser 12 and origin servers 18 are typically unaware of the 

□ 

proxy server. Content caching comprises storing, at cache 

i..n 

;,g server 32, cacheable content that is served to browsers 12 by 

the proxy server functionality of cache server 32. The cached 
ffl content may later be served to additional browsers 12 without 

ii; 15 going back to origin server 18. Cacheable content may be 
!*;: indicated by a set of headers associated with the content from 

.11 f=S, 

^ origin server 18 and by the configuration of cache server 32. 

□ Cacheable content also comprises HTTP POST requests. One 

element of the configuration of cache server 32 may specify 
2 0 that caching instructions in the headers of the content may be 
ignored. Content syncronization comprises preloading and 

expiring content stored at cache server 32 . Flow control 
allows for each incoming content request to be subjected to a 
scoring mechanism that indicates the priority of the content 
2 5 request. 

In the disclosed embodiment, a substantial portion of the 

content on cache server 32 will have been originally retrieved 
from origin server 18. Substantially little of the 

information available on cache server 32 is independently 
30 generated by cache server 32 . Alternatively, cache server 32 
may generate all or any portion of the content available on 
cache server 32. Cache server 32 and web server 30 may 



DAL01 : 4 89003 .3 
066241.0105 



ATTORNEY'S DOCKET 

066241 . 0105 



21 



PATENT APPLICATION 



optionally have their functionality combined into a single 
element such that cache server 32 provides suitable 
functionality to allow cache server 32 and flow control server 

34 to handle requests 22 as described previously without the 

5 use of a discrete web server 30. 

In operation, crawler 2 9 examines all or a portion of the 
web pages available at origin server 18. Crawler 29 generally 
communicates directly with web server 18 using the HTTP and 
HTTP Secure (HTTPS) protocols. Crawler 29 then hierarchically 

10 organizes the web pages for an administrator associated with 
origin server 18 (not shown) so that the administrator may 
determine which web pages should be cached at data centers 14 . 
Crawler 2 9 also allows the administrator to configure status 
information and alternate content associated with particular 

15 content items, priority information associated with requests 
22 and other suitable configuration information associated 
with controlling the caching of content from origin server 18. 
The configuration information provided by the administrator 
using crawler 2 9 is used to configure and control data centers 

20 14. Crawler 29 may alternatively determine automatically 

which web pages available at origin server 18 should be cached 

at data centers 14. When crawler 2 9 requires access to SSL 
secured web pages at origin server 18, crawler 2 9 communicates 
with origin server 18 through a crawler web server (not shown) 
25 similar to web server 32. The crawler web server is used to 

provide an SSL secured link between origin server 18 and 

crawler 29. The crawler web server appropriately encrypts and 
decrypts communications with origin server 18 while 
communicating using unencrypted communications with crawler 
30 29 . 

Cache server 32 receives a request for information, such 

as request 22, from web server 30. Cache server 32 then 
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examines request 22 and determines whether the requested 
content is locally available and current on cache server 32. 
If the requested content is locally available and current on 
cache server 32, cache server 32 communicates the requested 
5 content to web server 30 over link 36A for communication back 
to browser 12 . If the requested content is not available on 
cache server 32, cache server 32 communicates request 22 to 
origin server 18 under the control of flow control server 34 
as described below. After requesting content, cache server 32 

10 receives the requested content from origin server 18. Cache 
server 32 then determines whether the retrieved content is 
cacheable. If the content is cacheable, then cache server 32 
stores a copy of the content and then communicates the content 

to web server 30 for communication to browser 12. If the 
15 content is not cacheable, then cache server 32 does not store 
a copy of the information and passes the information to web 
server 30 for communication to browser 12. Cache server 32 
determines whether content is cacheable based on configurable 
cache criteria (described in more detail in association with 
2 0 FIGURE 5) . Cache server 3 2 is further operable to preload one 
or more content items. For example, cache server 32 may be 
configured by an administrator associated with origin server 
18 to preload content items that the administrator knows will 
be accessed in order to further decrease the load at origin 

2 5 server 18 . 

Cache sever 32 determines whether content is locally 
available and current based on at least two criteria. The 
first criteria is whether the requested content is present at 
cache server 32. If the requested content is present at cache 

3 0 server 32, the second criteria is whether the requested 

content is current or expired. Each content item stored at 
cache server 3 2 may have a respective expiration time 
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associated therewith. Since some content may never expire, or 
may not have an expiration time known ahead of time, any 
particular content item may or may not currently have a 

respective expiration time associated therewith. The 
5 expiration time describes when the content item expires and is 
no longer valid. The expiration time is updateable and may be 
initially set as a function of the content item such as the 
expiration time used by the HTTP protocol. In the disclosed 
embodiment, the expiration time comprises a time and date 
10 indicating when the associated content item expires, such as 

with an offset indicating an amount of time that must elapse 

before the content item is expired . The expiration time may 
alternatively comprise any suitable representation for 
representing when a content item expires. The expiration time 

15 may be accelerated, decelerated or otherwise changed by an 
expiration command 24 . 

Expiration command 2 4 is generated by data center manager 
16 and indicates a change in the expiration time associated 
with one or more items of content on cache server 32. In the 

20 disclosed embodiment, expiration command 24 comprises an 
Internet Cache Synchronization Protocol (ICSP) terse command 
(described in detail in association with FIGURE 4) . 

Expiration command 24 is generated by data center manager 
16 in response to data change message 28 from data source 20. 

25 Data change message 28 is generated by trigger 21 associated 
with data source 20. Trigger 21 detects a change in one or 

more content items available from data source 20. Trigger 21 

then generates data change message 2 8 indicating which content 
items have changed and communicates data change message 2 8 to 
30 data center manager 16. Data center manager 16 then generates 
one or more expiration commands 24 to expire the cached copies 

of the changed content items at one or more data centers 14. 
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For example, if an airline flight schedule has changed at data 

source 20, then the airline would not want customers receiving 
invalid flight schedules from data centers 14. Therefore, 
trigger 21 detects the change and communicates the change to 
5 data center manager 16. Data center manager 16 then informs 
data centers 14 of the change so that when a request for the 

flight schedule arrives at one or more data centers 14, data 
centers 14 will know to request a new copy of the flight 
schedule from origin server 18 associated with data source 20. 
10 In the disclosed embodiment, data centers 14 and specifically 
cache server 3 2 , do not request a. new copy of expired content 

except in response to a request 22 for that content. 
Alternatively, cache server 32 could actively request a new 

copy of the changed content in response to receiving 
15 expiration command 24. Note that since cache server 32 does 
not request a new copy of changed content in response to 
expiration command 24, it is possible and expected that cache 
server 32 may receive multiple expiration commands 24 expiring 
the same content. When multiple expiration commands 24 are 

2 0 received by cache server 32 which each expire the same or 

overlapping content , cache server 32 may repeatedly mark the 

expired content as expired, but will retrieve new content only 
in response to an appropriate request 2 2 . By not updating the 
expired content after each expiration command 24, cache server 

25 32 decreases the network and processing load on origin server 

18 by not requesting content that browsers 12 may not request 
for a long period of time or before the next expiration 
command 24 expiring that content. This provides a significant 
decrease in the processing and network load experienced by 

3 0 origin server 18 by avoiding retrieval of unused and yet to be 

requested content by cache server 32 . 
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One example of a flow control server that could fc>e used 

as flow control server 34 is described in association with 
FIGURE 4 . Flow control server 34 is operable to control and 

regulate the passing of information requests, such as requests 
5 22, from cache server 32 to origin server 18. Flow control 
server 34 operates to monitor the processing and network load 
on origin server 18. Cache server 32 requests permission from 
flow control server 34 to request locally unavailable content 
from origin server 18. Flow control server 34 determines 
10 whether cache server 32 may request content from origin server 

18 based on the processing and network load on origin server 

18 . 

In addition to controlling whether cache server 32 can 
request locally unavailable content from origin server 18, 

15 flow control server 34 is operable to determine the order in 
which requests for locally unavailable content are 
communicated from cache server 32 to origin server 18. Flow 
control server 34 may order the requests for locally 
unavailable content as a function of various load criteria 

20 (described in detail in association with FIGURE 6) configured 
by an administrator associated with origin server 18, such as 
a current state associated with each request. For example, 
cache server 32 requests permission to forward two requests to 
origin server 18, the first request is a search request for 

2 5 product information by a first user and the second request is 

a request from a second user to buy a product from an 
electronic commerce site. Flow control server 34 can 

prioritize and order the first and second requests so that the 
second request is handled first, because the administrator has 

3 0 indicated that buy requests should get immediate handling so 

that the user does not get impatient and abort the order while 
search requests which are less important are occupying the 
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server. Additionally, flow control server 34 could order the 

first and second requests based on information associated with 
the first and second users, such as the buying history or 
amount of money previously spent by the first and second 
5 users. Flow control server 34 evaluates the processing and 
network load on origin server 18 according to user-specified 
criteria to determine if cache server 3 2 may request content 
from origin server 18. For example, the user may specify that 
origin server 18 should not exceed 90% of its maximum 

10 processing capability. For another example, the maximum 
number of simultaneous connections may be limited to some 
number, such as 5,000. For yet another example, flow control 
server 34 may periodically interrogate origin server 18 to 
determine the current load and the criteria may specify the 

15 maximum allowable load on origin server 18 as a function of 
the response of origin server 18 to the interrogation, such as 
how long origin server 18 take to return a requested web page. 

When flow control server 34 does not allow cache server 
32 to request content from origin server 18 , cache server 32 

2 0 may be instructed to return status information to web server 
30. Status information comprises information alerting browser 

12 that processing of request 22 is in process, but will be 
delayed. For example, status information may describe the 
length of time before request 22 will be processed. The 

25 status information may also list the position of request 22 in 
a queue of requests to be processed. The status information 
may also present an advertisement to a user of browser 12. 
For example, the advertisement may be generated based on 
information previously gathered about the user associated with 

30 browser 12. Generally, the status information may 

configurably provide any suitable information to the user 
associated with browser 12 as determined by the administrator 
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associated with origin server 18 and configured into the load 
criteria for origin server 18. In the disclosed embodiment , 
the status information comprises an HTML web page generated 
based on load criteria determined by the administrator 
5 associated with origin server 18. 

Data links 3 6A and 3 6B comprise the data communication 
portion of data center 14 and support communication between 
web server 30, cache server 32 and control server 34. Data 
links 36A-B may comprise any suitable wireless and wireline 
10 computer networks. Data links 36A-B may also comprise a 
computer bus and any other suitable combination of hardware 
and software operable to support communication between web 
server 3 0 , cache server 32 and flow control server 34 . When 
two or more of web server 30, cache server 3 2 , and flow 

15 control 34 are executing on a single physical computer, then 

data links 36A-B comprise data communication systems for 
communicating between programs and within programs executing 
on a single physical device. For example, if web server 30 
and cache server 32 are both being executed by the same 
2 0 physical computer, then data link 3 6A may comprise a 
peripheral component interface (PCI) bus for transferring data 
between web server 30 and cache server 32. For another 
example, when web server 3 0 and cache server 32 are executing 
on separate physical devices, then data link 3 6A may comprise 

2 5 an Ethernet network. Data links 3 6A-B may represent two 

distinct data communication systems, portions of a single data 
communications system, or any suitable combination thereof. 

For example, data link 36A may comprise an Ethernet network 

while data link 3 6B represents a computer bus. 

3 0 FIGURE 4 is a block diagram illustrating details of a 

flow control server 134. FIGURE 4 illustrates a general flow 
control server for increased clarity and the application of 
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the general flow control server to flow control server 34 will 
be shown. Flow control server 134 operates to regulate and 
control the flow of data requests from a network device which 
desires data and a network device having the data. Flow 
5 control server 134 comprises one or more queues 100 , one or 
more sets of priority criteria 102, one or more load monitors 
104 and one or more sets of load criteria 106. Flow control 
server 134 is coupled to a flow controlled device 110 over a 
communications link 112 and to a protected resource 12 0 over 

10 communications link 122. Flow controlled device 110 and 

protected resource 12 0 communicate over communications link 
130. 

Queue 10 0 comprises a computer readable memory of any 
suitable type, such as random access memory, magnetic memory 
15 and optical memory. Queue 100 stores a plurality of data 
requests 13 6. 

Priority criteria 102 comprises information provided by 
an administrator (not shown) associated with protected 
resource 12 0 to server 134 for determining the relative 
2 0 importance of requests for data from device 110 to resource 
120. Priority criteria 102 may specify that specific types 
and groups of requests be given specific priority levels while 

other requests be given a default priority. For example, a 
product buy request at an e- commerce site or an add to 

2 5 shopping cart request may be given a higher priority, while a 

request for a product catalog may be given a lower priority or 
a default priority. For another example, a higher priority 
may be given to requests originating from a particular IP 
address because empirical data suggests that that IP address 

3 0 has a high probability of buying. For yet another example, 

priority criteria 102 may track purchasing patterns at an e- 
commerce site and determine that certain products have a 
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higher probability of actually being purchased after being 



added to a shopping cart. Also, priority criteria 102 may be 



suddenly becomes a popular item, priority criteria 102 can 
5 automatically adjust to the increased popularity of the item 
and assign a higher priority to requests related to that item. 

Load monitor 104 comprises an application program for 

monitoring and determining the current processing load 
experienced by resource 120. Load criteria 106 comprises 

10 information provided by the administrator associated with 
resource 12 0 to server 134 for determining the maximum load 
server 134 should allow resource 120 to experience. For 
example, load criteria 106 may specify that resource 12 0 
should experience no more than 10,000 simultaneous 

15 connections. For another example, load criteria 106 may 
specify that resource 120 not be loaded beyond 90% of the 

resource's processing capability. Also, load criteria 106 may 

specify the maximum allowable load as a function of the 
response of server 12 0 to interrogation by server 134. 

20 Communications links 112, 122 and 130 may comprise any 

suitable data communication system operable to support 
communication between device 110, server 134 and resource 120, 
In the disclosed embodiment, links 122 and 130 each comprise a 
portion of the Internet and link 112 comprises an Ethernet 

25 link. 

In operation, flow controlled device 110 desires to 
retrieve data from protected resource 120. Before retrieving 
data from protected resource 120, flow controlled device 110 
requests permission to retrieve data from flow control server 
30 134 by sending a flow request 132 over link 112. Flow request 
132 includes data request 13 6 that device 110 wishes to send 

DAL01 : 489003 .3 



updated dynamically. 



For example, if a particular product 



06S241.0105 



ATTORNEY'S DOCKET 
066241.0105 



30 



PATENT APPLICATION 



to resource 120. Specifically, device 110 will generate flow 
request 132 with data request 134 therein. 

Flow control server 134 receives flow request 132 and 
determines whether to grant or deny permission to device 110 
5 to request data from resource 12 0. Flow control server 134 
then determines the current processing load at resource 12 0 
and determines if the current processing load at resource 12 0 
exceeds load criteria 106 for resource 120. 

When the current processing load at resource 12 0 is below 

10 load criteria 106 , then server 134 grants permission to device 
110 and device 110 retrieves the desired data from resource 
120. When the current processing load at resource 120 exceeds 
load criteria 106, server 134 may deny permission to device 
110 to retrieve the desired data from resource 120. 

15 Load monitor 104 communicates with resource 12 0 over link 

122 to maintain current load information about resource 120. 
Any suitable load metric and combinations of load metrics for 
representing the current resource and processing load on 
resource 12 0 may be used. For example, the number of 

2 0 simultaneous connections currently being supported by resource 
12 0 may be monitored. For another example, load monitor 104 
may interrogate resource 12 0 to determine the current load, 
such as by retrieving one or more Uniform Resource Locators 
(URLs) from resource 12 0 and observing the response of 

25 resource 120. For yet another example, resource 120 may 
further comprise a monitoring application 140 for collecting 
suitable load information about resource 120 and communicating 
the load information to load monitor 104 over link 122 . For 
example, application 140 may monitor the CPU load on resource 

30 120. 

Server 134 then associates a priority value with data 
request 136 based on priority criteria 102 associated with 
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resource 120. In trie disclosed embodiment , trie priority value 

comprises an integer value between 1 and 10, but may comprise 
any suitable value having order. Server 134 then stores the 
prioritized data request 13 6 in queue 100 based on the 
priority value. Server 134 then removes the highest priority 
request currently in queue 100 and returns this request to 
device 110 with permission to send this request to resource 
12 0 and retrieve the associated data when resources at 
resource 120 are available to process the request. 

When prioritized request 136 is inserted into queue 100, 
prioritized request 13 6 is inserted into queue 100 based on 
the associated priority value. Specifically, queue 100 is 
sorted first on the priority value associated with each 

request in queue 100. Prioritized requests with equal 

priority values are ordered in a first-in, first-out order. 
Alternatively, other suitable arrangements for ordering 
prioritized requests in queue 100 may be used. 

In addition, when server 134 denies permission to device 

110 to retrieve information from resource 120, server 134 may 

specify that device 110 return a status page 13 8 or other 
alternate content to an associated user (not shown) who 
generated request 136 based on priority criteria 102. Status 
page 138 comprises any suitable alternate content returned as 
a function of priority criteria 102 and load criteria 104. In 
the disclosed embodiment, status page 138 comprises an HTML 
page. Specifically, server 134 provides a Uniform Resource 
Locator (URL) to device 110 which specifies the web page 
device 110 should retrieve as status page 13 8 in place of the 
web page requested by data request 136. For example, status 
page 13 8 may comprise a web page dynamically generated by a 
web server other than resource 12 0 based on the URL provided 

by server 134. Typically, the specified status page 138 will 
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include an automatic resubmission time, such as ten seconds, 
for re-issuing request 136. Alternatively, any suitable type 
of alternate content may be returned to the associated user 
instead of the content requested by request 136. One 

5 advantage of returning status page 13 8, instead of simply 
waiting for* a timeout or* returning "server too busy", to the 

user is that the user can receive a more useful and user- 

friendly response than "server too busy." Also, when a 
"server too busy" response is returned, in traditional 

10 systems, the original request is lost. Device 110 would then 
need to resubmit the request to have the request processed. If 
the request was to buy an item, and device 110 does not 
resubmit the request, the sale would be lost. In contrast, 
server 134 stores the request for later processing and the 

15 sale would not be lost. Also, by allowing the HTTP connection 
to close after receiving the status page, resources at both 

the client requesting the content and any servers which the 

client has contacted are released for other uses. 

Priority criteria 102 may also specify temporary 

2 0 alternate content, such as status page 13 8, or replace the 

requested content with replacement alternate content 139 based 
on the communication bandwidth available to the user who 
generated request 136. Specifically, priority criteria 102 
may specify temporary alternate content, such as status page 
25 13 8, or replacement alternate content 13 9 for a particular 
request 136 for each of a high-bandwidth connection, a medium- 

bandwidth connection and a low-bandwidth connection. The 

definition of high, medium and low are configurable by an 
administrator associated with resource 120. For example, the 

3 0 administrator of a streaming media web site may define "high" 

bandwidth differently from the administrator of a text -based 

news site. For example, in the disclosed embodiment, request 
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13 6 may request ht tp : / /www . domain . com/ stuff . j pg and priority 
criteria 102 may return stuff 1 . j pg for a low -bandwidth 
connection, stuff2.jpg for a medium-bandwidth connection and 
stuff3.jpg for a high-bandwidth connection as replacement 
alternate content 13 9. For another example, in operation, 
request 136 is queued in queue 100 by flow control server 134 

and the appropriate status page 138 is returned. Continuing 
the example, request 13 6 is then released from queue 10 0 and 
priority criteria 102 is used to determine that request 136 
should have high-bandwidth alternate content 13 9 returned 
instead of the content requested by request 136. Stated 

another way, status pages 13 8 and alternate content 13 9 are 
not mutually exclusive since request 136 could have a 
particular status page 138 returned and then have alternate 
content 13 9 returned instead of the requested content. 

Further, either, both or neither of the particular status page 
138 and alternate content 139 may be determined based on the 
bandwidth available to the user who originated request 136. 
Alternatively, a larger or smaller set of connection types may 

be defined by priority criteria 102, For example, a 

administrator associated with resource 12 0 could specify five 
different types of connections based on differing amounts of 
available bandwidth. 

Status page 13 8 may also be delayed before being 
retrieved. If the originally requested content becomes 

available before the delivery of status page 138, then the 

originally requested content will be returned instead of 
status page 138. For example, an administrator associated 
with resource 12 0 may know that a 1 second delay is common for 
resource 120 and specify that status page 138 not be delivered 
for 1.5 seconds to account for the expected 1 second delay. 
The administrator can use the delay functionality to control 
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the return of status pages 13 8 only when resource 12 0 is 
experiencing delays longer than an expected delay. 

An example illustrating status page 13 8 and flow control 
server 134 is provided for increased clarity. Device 110 
generates data request 136 in response to a desire for content 
from a user associated with device 110. Device 110 then 
generates flow request 132, includes data request 136 therein, 
and sends flow request 132 to server 134. Server 134 

determines the current load on resource 12 0 and, for the 
purposes of this example, determines that resource 12 0 is 

currently too heavily loaded to currently handle request 136. 
Thus, server 134 prioritizes request 136 and stores request 
136 in queue 100. Server 134 also consults priority criteria 
102 for resource 12 0 and determines that status page 13 8 
should be sent to the user. Priority criteria 102 for 
resource 12 0 specifies that a URL for a web page that will 

list the position in queue 10 0 of request 136 and be 
automatically re-submitted in ten seconds should be returned, 
such as : http : / / www. domain . com/ status__page . asp?queue_position= 
10resubmit=10 . Server 134 then responds to flow request 132 

and informs device 110 to retrieve the URL. Device 110 then 
retrieves status page 138. Note that flow control server 134 

may have knowledge of the expected time until resource 120 
handles the request and may set the automatic resubmission 
time to be slightly longer than the time resource 12 0 will 
take to process request 136. Specifically, server 134 knows 
that resource 12 0 will take 9 seconds to process request 13 6 
so server 134 sets the automatic resubmission time to 10 

seconds so that when request 136 is re- submitted to resource 
120, the response is already available and the user sees a 
very fast response time. 
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In an alternative embodiment f the functionality of flow 
control server 134 is used to provide content filtering. In 

this embodiment, protected resource 120 comprises the Internet 

and flow controlled device 110 comprises a computer attempting 
5 to access the Internet. Load criteria 106 specifies that all 

or substantially all data requests 135 should be stored in 
queue 100 before being passed to the Internet. Priority 
criteria 102 specifies that status pages 138 should be sent 
based on the type of data requested by data request 13 6. 

10 Specifically, priority criteria 102 specifies that status page 
13 8 should be returned when data request 13 6 requests 
restricted or illegal data. For example, in a country where 
certain types of data, such as images involving nudity, are 
illegal, priority criteria 102 may specify that a "Materials 

15 Illegal" status page 13 8 be returned when illegal data is 
requested and that the queued request 136 for illegal data be 
discarded without being passed on to the Internet. For 

another example, priority criteria 102 may specify a legal 
version of the requested illegal data that is to be returned 

20 instead of the illegal data. Priority criteria 102 may 
evaluate whether the requested data is illegal data based on 
the geographic origin of request 13 6, external information 
associated with a user who generated request 136, the data 
requested by request 13 6 and any other suitable criteria. 

25 In the disclosed embodiment of FIGURES 1 and 3, device 

110 comprises cache server 32, server 134 comprises server 34, 
resource 12 0 comprises origin server 18 and data request 13 6 
comprises request 22. Also in the disclosed embodiment, the 

Flow Control Protocol (FCP) (described below) is used to 

3 0 support the flow control functionality described above, such 
as flow request 132. 
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The Flow Control Protocol (FCP) is used to exchange 
messages between the flow controlled device and the flow 

control server. The FCP may also be used for communication 
among any number of flow control servers 134 . The FCP is 
5 described here in terms of the World Wide Web (WWW) , but could 
be used in any suitable environment for controlling data flow 
between a flow controlled device and a protected resource. 

FCP Overview 

10 When a flow controlled device receives a request, it 

sends an initial request message to the flow control server to 
gain permission to send the request to the origin server. The 
flow control server responds with an initial response message. 

This message indicates go (send) or no go (do not send) . If 

15 the response is go, the flow controlled device is allowed to 
send the request to the protected resource. 

If the message indicates no go, the flow controlled 
device must wait until the flow control server sends it a 

request to send message. This messages gives the flow 
20 controlled device permission to send the request to the 
protected resource . 

The flow controlled device responds to the subsequent 
request by returning a. response status message to the flow 

control server indicating how many requests were received, how 
2 5 many were processed, or if an error occurred. 

In the event that a flow controlled device quits and 

another flow controlled device receives the request ID 
intended for the original device, the new device must send an 
ownership transfer message to the flow control server so it 
30 can n own" the request. The flow control server responds to the 
flow controlled device with an ownership assignment message. 
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If the flow control server has no record of the request, it 
must broadcast an ownership query message to the remaining 
flow control servers to see if the request resides on any of 
them. The flow control servers return an ownership query 
response message indicating the request. 

Initial Request Message 

When a flow controlled device receives a request from a 
browser, it sends an initial request message to the flow 
control server to gain permission to send the request to the 
origin server. The initial request message uses the following 
syntax: 

[xxxx] [xxxxxxxxxxxxxxxxxxxxx] Request 



TABLE 1 defines the values in the message syntax. 



Field 


Meaning 


xxxx 


The length of the entire message. 

The maximum length of the message 
is 4127 bytes, which is composed of 
a maximum message length of 4092, 

plus the length and ID fields. 
Valid integers for the length are 
0032 - 4127. If the length is fewer 

than four digits, use zeros for 
padding . 


xxxxxxxxxxxxxxxxxxxxxx 


The Globally Unique Identifier 
(GUID) . This identifier is uniquely 
associated with the request. Each x 
represents a character. The GUID 
length is 22 characters. 


Request 


The entire http request . The 
request cannot be more than 4096 
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bytes (4 K) . 

TABLE 1 

An example of an initial request is shown below. The 
request portion is only intended as a representation and is 
not a full request. 

[L0077] [127.000.000.001- 

Of b450] GET/http : //xf iles . domain . com/test/d . html 
Note: There are no spaces between the brackets. 

Initial Response Message 

The flow control server bases its response to the device 
upon various criteria, such as the load on the origin server. 

If the number of requests to trie protected resource surpasses 
its designated maximum load, the flow control server will not 
allow the flow controlled device to send the request. The flow 
control server sends an initial response message to the flow 
controlled device using the following syntax (note that "\t" 
indicates one tab stop between values) : 

GO | NOGO \ tPause \ tre fresh ra te \ ts ta tusURL \n 



TABLE 2 defines the values in the message syntax. 



Field 


Meaning 


GO | NOGO 


The response is either GO or NOGO. 
GO indicates the flow controlled 
device has permission to send the 
request to the origin server. NOGO 
indicates that the flow controlled 
device cannot send the request to 
the origin server until it receives 
a subsequent response message with 
a value of GO. 
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Pause 


A whole number indicating the 
length, in seconds, which to pause 
before sending a status page to the 
requesting browser. If the content 
is available before the time 
expires, a status page is not sent. 

Note: The pause value can be zero 
whether or not there is a status 

page URL. 


Refresh 


A whole number indicating the 
length, in seconds, which to wait 
before resubmitting the GUID to the 
flow controlled device. 

Note: The refresh value cannot be 
zero if there is a status page URL. 


StatusURL 


The URL of the status page to send 
to the requesting browser. If there 

"ipi no ?=! t~ ^ 1~ 1 1 s "DRCtf* TTRTi t~h^ valiiF* 
can be null . 

Note: If the value is null, the 
pause and refresh values must both 
be zero . 


\n 


Indicates line feed. 



TABLE 2 

Example initial responses are shown below. 



GQ\ 1 3 \ 1 7 \ t ht tp ; / /www . company . com/ s t atuspage . htm\n 

This message indicates that the flow controlled device can 
send the message to the origin server. The pause time of 3 
seconds indicates that if the request takes more than 3 
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seconds to return from the origin server, the flow controlled 
device is to send the indicated status page to the browser. 
The status page will resubmit the GUID to the flow controlled 

device in 7 seconds to check if content is available. If yes, 
5 the content is returned. If no, another status page is 
returned that will resubmit in 7 seconds. 

In the example below, the flow controlled device is instructed 

not to send the request. There are no other values. 

NOGO\t0\t0\t\n 
\Q 1 0 Request to Send Message 

/tj: If the initial response message is NOGO, the flow control 

server sends a request to send message when it is ready to 

i:0 give the flow controlled device permission to send the request 

to the origin server. The flow control server sends the 

m 15 message using the following syntax: 

S GO?GUIDl \nOUID2\nOUID3 \n\n 

jR TABLE 3 defines the values in the message syntax. 



Field 


Meaning 


GO 


GO indicates the flow controlled 
device has permission to send the 
request to the origin server. 


1-n GUIDs 


A subsequent request message can 
contain any number of GUIDs . 


\n 


Indicates line feed. Each GUID must 

be separated by \n. 


\n\n 


Indicates the end of the message. 



TABLE 3 

The example below shows a subsequent request message. 
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GO? 12 7 . 000.000. 001-0fb450\nl2 7 .000.000.001- 

Qbe350\nl27.QQQ.QQQ.QGl-QdaQ4Q\n\n 

Response Status Message 

When a flow controlled device receives a subsequent 

5 response message, it sends a response status message to the 
flow control server indicating how many requests were received 

and how many were processed. These numbers should be the same 
if no errors occurred. If an error occurs, the status message 
will either indicate an error or the two numbers will not be 
10 the same. 

Ownership Transfer Message 

When a flow controlled device sends a status page URL to 
a browser, the refresh value in the response message indicates 

the browser will resubmit the GUID to the device at defined 

15 intervals. When the GUID is resubmitted, it assumes that the 
request lives in the flow controlled device that originally 
serviced the request . 

In cases where the flow controlled device has stopped, 
the resubmitted GUID will go to another flow controlled 

20 device. However, the initial request does not reside on this 
device, so the flow controlled device is unable to associate 
the GUID with a request. The new device must gain ownership of 
the request by sending an ownership transfer message to the 
flow control server. 

2 5 Ownership Assignment Message 

In response to the ownership transfer message, the flow 
control server sends the flow controlled device an ownership 
assignment message. This message indicates the request that is 

associated with the GUID. 
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Ownership Query Message 

In cases where both the flow controlled device and the 

flow control server have stopped, the ownership transfer 
message will reach a flow control server that has no record of 
the request. The flow control server must broadcast an 
ownership query message to the remaining flow control servers 
to determine the request. 

Ownership Query Response Message 

Flow control servers that receive an ownership query 

message respond to the requesting server with an ownership 

query response message. This message notifies the querying 
flow control server of the request associated with a 

particular GUID. 

FIGURE 5 is a block diagram illustrating details of a 

cache server system 200. FIGURE 5 illustrates a general cache 

server for increased clarity and the application of the 
general cache server to cache server system 3 2 will be shown. 
FIGURE 5 is discussed in terms of the Internet Protocol (IP) , 
however, any suitable network protocol, such as ATM, frame 
relay and others, may be used. 

Cache server system 200 comprises a cache server portion 
202 and a dispatch server portion 204. Cache server system 
200 is coupled to a web client 206 via a communications link 
208 and to an origin server 210 via a communications link 212. 
Web client 206 is coupled to a DNS server 214 via a 

communications link 216. Origin server 210 is further coupled 

to a data source 218. 

Web client 2 06 comprises any suitable HTTP client 
operable to communicate with system 200 over link 208. In 
the disclosed embodiment, web client 206 comprises a web 
browser such as Internet Explorer. Origin server 210 comprises 
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software executing on a computer for receiving and responding 
to one or- more data, requests, such as requests 22 and requests 
136. Origin server 210 may comprise a single computer 

executing software , a plurality of computers each executing 
5 software or may comprise hardware with similar functionality. 
In the disclosed embodiment, origin server 210 comprises a 
single computer executing an HTTP server which is remote from 
cache server system 200. Origin server 210 retrieves 

information from data source 218 in response to data requests, 
p 10 Similar to origin server 18, origin server 210 is operable to 

!~ retrieve static content and to generate new, dynamic content. 

■J2 For example, origin server 210 may generate a new web page 

ia r; using a common gateway interface (CGI) script, generate a new 

ff\ web page from the result of a structured query language (SQL) 

;IL 15 request and perform other suitable content generation 

51 functions. Origin server 210 may also be operable to 

jsg generate executable software, such as applications and 

applets, in response to requests for data. For example, 
origin server 210 may generate a Java applet in response to an 
20 appropriate data request. 

DNS server 214 comprises any suitable DNS server operable 

to communicate with client 2 06 over link 216 and provide DNS 
resolution services to client 206. Specifically, DNS server 
214 translates Internet domain names and machine names into 

2 5 the IP address associated that machine and domain. 

Data source 218 comprises magnetic storage devices, 
optical storage devices, persistent storage devices, transient 
storage devices, and any other suitable storage device or 
technology for storing data. For example, data source 218 may 

30 be a Relational Database Management System (RDBMS) or an 
Oracle database. Data source 218 may comprise a portion of 
origin server 210 or may be physically separate therefrom. 
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Data source 218 includes a trigger portion 220. Trigger 
portion 22 0 comprises an application program, applet, software 

within a database, or other suitable executable software 

associated with data source 218 and operable to detect a 
5 change in the data stored at data source 218. Trigger portion 
220 generates data change message 28. 

Data change message 2 8 comprises any suitable message 
operable to indicate a change in the data or any portion of 

the data stored at data source 218. In the disclosed 

10 embodiment, data change message 2 8 comprises an Internet Cache 
Synchronization Protocol (ICSP) verbose message (described 

below) . 

Cache server 2 02 comprises cache criteria 222 and a 
software program executing on a data processing system for 
15 storing and retrieving content in response to information 
requests from web client 206. Specifically, cache server 202 
operates to store content, such as web pages, previously 

requested by web client 206. In the disclosed embodiment , a 

substantial portion of the content on cache server 2 02 will 
20 have been originally retrieved from origin server 210. 
Substantially little of the information available on cache 
server 202 is independently generated by cache server 202. 
Alternatively, cache server 2 02 may generate all or any 

portion of the content available on cache server 202. 

25 Cache criteria 222 comprises criteria provided by an 

administrator associated origin server 210 for determining 
which content available from origin server 210 should be 
cached. Cache criteria 222 may comprise any suitable criteria 
used to allow cache server 202 to determine whether content 

3 0 should be cached. In the disclosed embodiment, cache criteria 
222 comprises one or more URLs specifying web pages that 
should be cached and web pages that should not be 
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cached. The URLs may be specified individually, 

such cis http://www.doraain.com/dont_cache_me. html, or with 

wildcards, such as http : //www. domain. com/dont_cache_directory/ 
*.html for multiple files and http : / /* . no__cache_servers . 
5 domain. com/* .html for multiple machines. Each origin server 
210 may have distinct cache criteria 222. For example, cache 
criteria 222 may specify that certain web pages that could be 
cached should not be cached, such as pages containing a user's 
credit card number. For another example, cache criteria 222 
10 may specify that certain web pages should not be cached 
because the nature of the web page makes caching useless, such 

as credit card transactions which are only valid for that 
single credit card transaction. 

Dispatch server 2 04 is coupled to data source 218 and 
15 cache server 202. Dispatch server 204 operates to expire 
content at cache server 202. Dispatch server 204 expires 

content at cache server 2 02 by sending data expiration 
commands 24 to cache server 202. 

Cache server 2 02 and dispatch server 2 04 may both be 

2 0 located on a single physical computer system or may be located 

on separate computer systems . Further, cache server 2 02 may 
comprise multiple computer systems each providing caching 
services. For example 7 cache server 2 02 may comprise a server 

farm with a router (not shown) dispatching requests to 
25 individual elements of cache server 2 02 for handling. 

Data expiration command 24 comprises any suitable message 
for expiring data at cache server 202. In the disclosed 
embodiment, data expiration command 24 comprises an ICSP terse 
message (described below) . 

3 0 The ICSP terse command may expire any of a single web 

page, a plurality of web pages at a single web site, a 
plurality of web pages at a plurality of web sites, a 
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plurality of sites within a single domain and one or more 
specific objects on a web page, such as an image. For 
example, the ICSP tease command may expire a single web page 
such as http://www.domain.com/web_jpage.html. The ICSP terse 
5 command may expire a plurality of web pages at a single web 
site such as http://www.domain.com/web__pages/* which would 
expire all web pages in the "web_pages" directory at the site 
www.domain.com. The ICSP may expire a plurality of pages at a 

plurality of web sites such as http://* .domain.com/web_pages/* 

10 which would expire all web pages in the "web_pages" directory 
of each site in "domain.com". The ICSP terse command may 

expire a plurality of web sites such as http://* .domain.com 

which would expire all web pages on all sites in "domain. com" . 
For another example, a single active server page (ASP) may 
15 result in many individual cached pages. A single ASP page may 
generate large numbers of individual cached pages because a 
single ASP page can dynamically create multiple different 
specific pages, such as http://www.domain.com/product 
^description. asp?category= &product= may specify an ASP 

20 page that generates many specific product description pages 

based on a specified category and product. To cache 
server 2 02 , product_descript ion . asp ? ca t ego ry= Byproduct =7 , 
product_description. asp?category=5&product=6 and 
product_description . asp?category=3&product=7 specify three 

25 different cached pages. By expiring 

product_description . asp?category=*&product=7 ALL categories of 
product seven are expired, while not effecting product six 
related pages even though all of the pages were generated from 
a single ASP page. 
30 Data links 208, 212 and 216 may comprise any suitable 

networking system or technology operable to support 
communication between DNS server 214, web client 2 06, cache 
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system 200 and origin server 210. In the disclosed 

embodiment, data links 208,212 and 216 each comprise a portion 

o£ the Internet . 

In operation, origin server 210 contracts with cache 
5 system 200 for caching services. In order to provide caching 
services, an administrator associated with origin server 210 
configures DNS server 206 or allows DNS server 206 to be 
configured to resolve one or more machine and domain names 
associated with origin server 210 to resolve to cache system 
i;a 10 200 instead of origin server 210. Stated another way, after 
^ contracting for service, domain names and machine names 

;^ associated with origin server 210 will translate into one or 

: S p more IP addresses associated with cache system 200. 

iTii 

SJ Web client 2 06 generates a data request for content from 

ii; 15 oriqin server 210. Web client 206 consults DNS server 214 

£P over link 216 to translate the machine and domain name 

% associated with origin server 210 into an IP address. Since 

;:: origin server 210 has contracted for caching services from 

cache system 2 00, the IP address returned for origin server 
20 210 is actually an IP address associated with cache system 

200, The IP address associated with cache system 200 may be 

an IP address identifying cache server 202, a particular cache 
server 2 02 out of a farm of cache servers 2 02 or a router 

associated with a farm of cache servers 202 (not shown) for 

25 routing requests to a specific cache server 2 02 out of the 
farm of cache servers 202. Web client 200 then communicates 
the data request to cache system 2 00 over link 208 based on 
the IP address returned from DNS server 214. 

Cache server 2 02 receives the data request from web 
30 client 206 and determines if the requested content is stored 

and current at cache server 202. If the requested content is 
stored and current at cache server 2 02 then cache server 2 02 
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will return the requested content to web client 206. If the 
requested content is either not stored or not current 
("expired") at cache server 2 02, cache server 2 02 will 
retrieve the requested the data from origin server 210. After 
5 retrieving the requested data from origin server 210, cache 
server 2 02 will determine whether the requested data is 
cacheable based on caching criteria 222 for origin server 18. 
If the requested data is cacheable, then cache server 2 02 
stores a copy of the requested data and returns a copy of the 

10 requested data to web client 2 OS. In the disclosed 

embodiment, cache server 2 02 is operable to ignore the "no 
cache" directive associated with the HTTP protocol. 

Trigger portion 220 of data source 218 generates data 

change message 2 8 indicating a change in the content or 
15 portions of content available from origin server 210 and 

stored in data source 218. Trigger portion 220 may also 

indicate a change in the data based on the time and date. For 
example, trigger portion 22 0 may know that certain content 
changes at periodic intervals at data source 218, such as when 

20 data source 218 is a file system and trigger portion 220 is a 
process monitoring the file system for changes , and generate 
data change message 2 8 for that content independently of 
detecting a change in the content. 

Dispatch server 204 generates data expiration command 24. 

25 For example, dispatch server 2 04 may generate data expiration 
command 24 in response to a change in the content at data 
source 218. Specifically, dispatch server 204 generates data 
expiration command 24 in response to data change message 28 
from data source 218. Dispatch server 204 may also generate 

3 0 data expiration command 24 in response to the elapsing of a 
predetermined time period. For example, if a previously 
received data change message 28 specified that the expiration 
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of the data at cache server 2 02 should not occur for a set 
period of time, then dispatch server 204 will wait that set 
period of time before expiring data at cache server 202 by- 
generating data expiration command 24. 
5 The Internet Cache Synchronization Protocol (ICSP) is 

used to exchange messages to synchronize cached content with 

updated content. ICSP uses two levels of messages. ICSP 
Level 1 messages are sent with the HTTP protocol. ICSP Level 2 
messages are sent via sockets on the same network level as, 
10 but independent of, the HTTP protocol. Both levels specify an 

ICSP request and an ICSP response. For both levels, an ICSP 

request typically specifies a list of URLs and the operations 
to perform on them, such as expiring them in the cache or 
removing them from a cache list. A request also includes a 
15 header specifying information about the protocol, domain, 

requesting agent IP and type, and time. 

For both levels, the request message can be either terse 
or verbose. Cache servers, such as cache server 2 02, listen 
for terse messages , which indicate the action should occur 
2 0 immediately. Dispatch servers, such as dispatch server 2 04, 
listen for verbose messages, which indicate a particular time 
for the action to occur and specify the interval at which the 
action should reoccur. 



Request Header Fields 



.Reqruest Header- Field 


Meaning 


Version 


Protocol version, expressed as 
ICSP/x.x 


Domain 


Domain to which the message 
applies. For example, domain.com 


Request ing-Agent- IP 


The requesting agent IP. This can 
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be omitted if you are already using 

HTTP. 


Requesting -Agent -Type 


The requesting agent type (ICSP 

Dispatcher or Client Gateway) 


Request -Time 


The time of the request in UTC 
format (milliseconds) . 



TABLE 4 

Request Header Syntax for ICSP Level 1 Messages 

The header is inserted into the HTTP header as shown in the 
following example. 



5 Version: ICSP/1.0\r\n 

Domain : domain . com \n 

Note that "\n" represents a new- line character and "\r" a 

carriage return. 

Request Header Syntax for ICSP Level 2 Messages 
10 Unlike ICSP Level 1 request headers, a verbose request header 
is not inserted into an HTTP header. However, the Level 2 
header must appear the same as an HTTP header, as shown in the 
following example: 

Version: ICSP/1 . 0\n 

15 Domain: domain.com \n\n 

For terse requests, no header is required. 

ICSP Verbose Request Message Body 

In a verbose request, the message body contains messages that 
specify: an action to take, the URL on which to take the 
2 0 action, the time and interval at which the action on the URL 
will occur and the priority of the action on the URL. 
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Verbose request message syntax: 

< Operation Code indicating action>\n 
URL upon which to take action\n 

UTC in milliseconds indicating when to take action\n 
5 Interval in milliseconds indicating when to repeat 

action\n 

Integer Indicating action pr±ority\n 
<Operation Code indicating end message> 

10 The only required elements of the verbose request are the 

operation codes and the URL. The remaining elements will use 
default values if not specified. Specifically, the default 
value for the UTC is 1, which represents that the action 
should be taken immediately, the interval is zero which 
15 represents no repetition and the priority is three out of a 
one to five scale. Because there are no labels for these 
elements, the order in which they are listed. is very 
important . Therefore the interval and priority may not be 
indicated without also indicating the UTC, because the 

20 interval's value would be interpreted as the UTC and the 

priority's value would be interpreted as the interval since 
the values are interpreted based on their position. However, 
elements may be omitted in reverse order of the syntax. For 
example, all elements except the priority may be specified, 

25 since priority is the last element positionally , or all 
elements except for the priority and interval, such these are 
the last two elements positionally. 

TABLE 5 describes the actions and their associated 
operations codes that may be performed in a message. The 

3 0 actions are expire, add a new URL, remove a URL, rename a URL, 

and load a URL. For example, adding, loading, renaming and 

removing URLs may be used to modify caching criteria 202. To 
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receive an immediate response on the status for any of the 

actions, the code for a response action and the operation code 
number, such as R0 or R2 should be used. The operation codes 
in TABLE 2 would be inserted at the <Operation Code ind.ica.ting 
action> and the <Operation Code indicating end message> 
sections of the verbose request message as described above. 



Operation Codes for Verbose Messages 



Verbose 

Operation code 


Meaning 


<S0> 


Expire URL * This request flushes an 
object associated with, a URL from ttie 
cache. The URL is cached again until the 
next time it is requested. 


<S1> 


Add. This request adds a URL to the list 
of cached URLs. This allows a message to 
perrorm setup operations normally 
carried out in the administrative 
modules . 


<S2> 


Remove. This request removes a URL from 
the list of cached URLs. This allows a 
message to perform setup operations 
normally carried out in the 
administrative modules. 


<S3> 


Rename . This request renames a URL and 
can be used when a file, such as a gif 

or HTML page, is renamed. 


<S4> 


Load. This request expires the URL, and 
then loads it into the cache before the 
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first request comes. This is useful for 
items such as the graphics, which appear 
on several pages and rarely change . The 

load could be issued so that the first 
user does not have to wait for all of 
the images to be cached. 


<RO> 


Respond 


to 


expiration request. 


<R1> 


Respond 


to 


request to add URL. 


<R2> 


Respond 


to 


request to remove URL . 


<R3> 


Respond 


to 


request to rename URL. 


<R4> 


Respond 


to 


request to load URL . 



TABLE 5 

Unlike send actions <S>, response actions <R> are synchronous. 
If in the message body there are no response actions, the 
entire message body is sent to dispatch server 2 04 at one 
5 time. However, if there are any response actions in the 
message body, that action must be sent to dispatch server 204 
and on to cache server 2 02 and responded to before the 
remaining actions can be sent to dispatch server 204. 
Therefore, response actions can slow verbose messages. 

10 

For example, a verbose request messages may comprise: 

<S0>\n 

http : //www. domain. com/ index. html ?param=value \n 
94000000000000\n 
15 100000\n 

l\n 
</S0> 
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URLs 

The second element of the verbose request, following the 

operation code, is the URL upon which the specified action 
will take place. The URL is specified as the base URL and, 
5 optionally, one or more parameters. The URL can specify the 
value of one or more parameters and may include parameters, 
where applicable, for both POST and GET HTTP messages. The URL 

length is preferably no more than about 4 kilobytes, but may 
be longer in an alternate embodiment. ICSP level one messages 
10 are required to use URL style encoding because the ICSP level 
one message uses HTTP. Since the ICSP level two message uses 
sockets the URL encoding format is not required. Various 
examples are provided below for increased clarity. In each of 
the following examples, the action is to expire <S0> (i.e. 

15 expire a URL) . 

To expire a specific page, include the page name in the URL: 
<S0> 

http : //www . domain. com/ foresite/ j obs /programmer . dml 
</S0> 

20 In this example, the action applies only to the specific page. 
For example, if the page contained an image named splash.gif 
and it too should be expired, then its URL of the image must 
also be specified in the message, as shown in the following 
example : 

25 <S0> 

http : //www. domain. com/ images/ splash.gif 
</so> 

To expire specific parameters with specific values include the 
specific parameters to expire. In this example, the action 
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applies to this page only when the parameter position equals 
programmer and the parameter type equals fulltime. 

<S0> 

http: //www, domain .com/£oresite/jobs?posit ion-programmer&type=f 

5 ulltime 
</S0> 

To expire specific pages, regardless of the value of the 

page's parameters, include the specific parameters to expire 
and indicate their value with a wildcard, represented by an 
10 asterisk (*) for multiple characters. Other suitable wildcards 
may also be used, such as a wildcard for a single character. 

<so> 

http : //www. domain. com/f ores it e/ jobs ?position=*&type= * 
</so> 

15 In this example, the action expires any page in the URL with 
values for position and type. 

To expire a page, regardless of parameter name, value, or 
quantity all parameters may be replaced by an asterisk (*) . 
<so> 

2 0 http : //www. domain. com/f oresite/ j obs?* 
</S0> 
Time 

The third element of a request message body is time. The 
time that an action executes is specified in Universal 
2 5 Coordinate Time (UTC) . Optionally, the time of execution may 

be specified as immediate; indicated by the value 1, and 

never, indicated by the value 999999999. 
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In the following example, a specific time is indicated to 
expire the URL and, thereafter, every 100 seconds. 

<S0> 

http : / /www, domain . com/ index . html ?param= value 

5 94000000000000 

100000 
<E> 

In the next example, the request indicates to expire the URL 
now and, thereafter, every 100 seconds. 

10 <S0> 

http : //www . domain . com/index . html ?param= value 
1 

100000 
<E> 

15 

Priority 

The fourth element of the request message body is 
priority. For example, one of five priorities (1-5) may be 
assigned to a request. In general, all but the highest (1) 

20 priorities are relative and only apply to URLs within the 
specified domain. Other priority systems may be used in 
alternate embodiments of the present invention. Requests of 
higher priorities are guaranteed to execute before requests of 
lower priorities. The execution order of URLs within the same 

25 priority is first in, first out. 

The only non-relative priority is the highest priority 
which, in this example, is 1. This priority means that the 
request may not be queued. It the request cannot be executed 
at the specified time, then it is not completed and generates 
3 0 an error. 
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In the following example, the request to remove a URL is 
given a priority level of 3 . 

<S2> 

http : / /www. domain, com/whatsnew. html 
5 85000000000000 
0 
3 

<E> 

10 ICSP Terse Request Message Body 

In a terse request, the message body contains a message that 
specifies : 

• An action to take 

• The URL on which to take the action 

15 In terse messages, the action is always expire. Because a 
terse request indicates the action should occur immediately, 
there is no need to specify time or a priority. 

Terse request message syntax: 

a?data tzype-daiza. 
2 0 where, 

a = the action 

? = delimiter 

data type = the data type 

data = the actual URL 

25 

Terse request message example: 

x?url- http : / /www, domain. com/ index. html ?param= value 
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The following sections describe each element of the terse 

request message. 

Operation Codes for Terse Messages 

TABLE 6 describes the actions that may be performed in a terse 
5 message . 



Operation 
code 


.De s crip t i on 


X 


Expire . This request flushes an URL from 
the cache. The URL is cached again the 
next time it is requested. 


n 


Start flow control. This starts the flow 
control of a base URL by a flow control 
server such as server 134 . 


f 


Stop flow control. This stops the flow 
control of a base URL by a flow control 

server such as server 134. 



TABLE 6 

ICSP Verbose Response Description 

This section provides a description of the ICSP verbose 
response including : 

10 • ICSP Response Header 

• ICSP Response Message Body 

Verbose Response Header 

The ICSP verbose response header contains a message response 
and response code, as shown in the following example. 

15 ICSP-Message :OK 

ICSP-Code : 100 
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ICSP Verbose Response Body 

The ICSP verbose response body contains a message code and 
corresponding message text, as shown in the following example. 

100: OK 

5 ICSP Terse Response Description 

The section provides a description of the ICSP terse response 
header. There is no message body associated with the response. 

Terse Response Header 

;!S , ; The ICSP terse response header contains the response code and 

!»3 10 corresponding message text, as shown in the following example. 

M 100: OK 

h * i Ports 

□ For ICSP Level 1 (HTTP) , both verbose and terse messages 

15 are listened for on port 9092, but another suitable port may 

^ alternatively be used. 

For ICSP Level 2 (sockets) , verbose messages are listened 
for on port 13 70, and terse messages are listened for on port 
1827, but another suitable port may alternatively be used. 

20 

FIGURE 6 is a flowchart illustrating the data caching and 
dynamic flow control method of system 10. The method begins 
at step 3 00 where client 11 generates request 22 for content 
using browser 12. Next, at step 3 02, the data center 14 
25 closest in network distance to client 11 is determined. In 
the disclosed embodiment, a DNS lookup is performed on request 
22 which resolves to the internet protocol (IP) address of the 
data center 14 which is closest to client 11 in terms of 
network distance considering either or both of latency and hop 

30 count, or any other suitable metric for measuring network 
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distance. Then, at step 3 04, data center 14 receives request 
22 and communicates request 22 to web server 30. 
Alternatively, request 22 is communicated to cache server 32 
if web server 3 0 is not being used or has been merged into 
cache server 32 . When cache server 32 and web server 3 0 are 

merged, cache server 22 provides all appropriate functionality 

described below as performed by web server 30. As web server 
3 0 may comprise multiple physical servers each executing web 
server software, data center 14 may receive request 22 and 
then communicate request 22 to web server 3 0 having the lowest 
processing load. The processing load indicates, for example, 
the number of requests 22 which web server 3 0 is currently 
handling. For example, if web server 3 0 comprises two 

physical computers running web server software, and the first 
computer is handling ten requests while the second computer is 
handling only five requests, then data center 14 would route 
the next incoming request to the second computer. Proceeding 
to step 3 06, the chosen web server 3 0 receives request 2 2 and 
begins processing the received request 22 . 

Next, at step 3 08, web server 3 0 determines whether 

browser 12 which initiated request 22 is allowed to access the 
requested content. Web server 30 may use any authentication 
system suitable for authenticating requests 22, such as HTTP 

requests. If browser 12 is not allowed to access the 

requested content then the NO branch of decisional step 3 08 is 
followed to step 310. At step 310, web server 30 returns an 

access denied response to browser 12 and the processing of 
request 22 ends. If the user is allowed to access the object 
then the YES branch of decisional step 308 is followed to step 
312 . 

At decisional step 312, web server 3 0 determines whether 
request 22 is encrypted. Request 22 may be encrypted in order 
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to prevent the interception of the information contained 
within request 22. For example, if browser 12 and web server 
3 0 are communicating using the Secure Sockets Layer (SSL) 
protocol then request 22 will be encrypted so that only 
5 browser 12 and web server 30 can read the contents of request 
22 . Common uses of SSL comprise transferring credit card 
information between browser 12 and web server 3 0 when 

performing electronic commerce. If request 22 is encrypted 
then the YES branch of decisional branch 112 is followed to 
10 step 314 where request 22 is decrypted. At step 314, request 
22 may be decrypted using any of a variety of methods and the 

method proceeds to step 216. If the request is not encrypted 

then the NO branch of decisional step 312 is followed to 
decisional step 316. After the request is decrypted at step 
15 314, the method proceeds to decisional step 316. 

Then 7 at decisional step 316, web server 3 0 determines 

whether request 22 is a status page resubmission. As 
described in more detail in step 348 , flow control server 34 
may prevent cache server 3 2 from immediately requesting 

20 locally unavailable content from origin server 18. When cache 
server 32 is instructed to delay requesting locally 
unavailable content from origin server 18 by flow control 
server 34 then cache server 32 may return a status page to the 
user associated with browser 12 which includes status 

25 information associated with the delayed processing of the 
request. The status page may include an automatic 

resubmission mechanism for updating of the status page. For 
example, if the status page indicates a length of time that 
processing of request 22 will be delayed, then the status page 

3 0 may be automatically resubmitted after the stated period of 
time in order to get an update on the processing of the 
request. If request 22 is a status page resubmission, then 
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the YES branch of decisional step 316 is followed to step 348. 
When request 22 is not a status page resubmission then the NO 
branch of decisional step 316 is followed to decisional step 
318 . 

5 At decisional step 318, cache server 32 determines 

whether request 22 is requesting a cacheable item of content. 

Cache server 32 is operable to examine the contents of request 

22 to determine whether the requested content item is 
cacheable. In the disclosed embodiment, the elements of the 
10 URL comprising request 22 are examined. Cache server 32 
determines whether a content item is cacheable based on 
various configurable criteria. In the disclosed embodiment, 
the criteria determines whether the requested item of content 

is cacheable based on portions of the uniform resource locator 
15 (URL) associated with request 22. For example, when the URL 
associated with request 22 includes an authorization code for 
a particular electronic commerce transaction then the criteria 

can specify that the content item retrieved in response to 

request 22 should not be cached because electronic commerce 
20 authorization numbers are valid for only that one instance. 
In general, any portion or portions of the URL associated with 
request 22 may be used to determine whether the content item 
retrieved in response to request 22 is cacheable. 

In addition, unneeded portions of a URL can be filtered 
2 5 to increase the amount of content that is cacheable and 
correspondingly decrease the load on origin server 18. For 
example, a URL comprising 

http : //www . domain . com/ e_commerce/ cgi -bin/ query? id=l&f light =3 2 
may represent a request for dynamically generated output from 
30 a CGI script for a user with an identifier (id) of 1 for a 
flight schedule for flight number 32. Cache server 32 can be 
configured to cache and respond with the output of the URL 
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while ignoring the "id=l" portion of the URL. For example, 
the identifier value may not impact the information presented. 
Future requests for flight schedules for flight number 32 can 
be handled by cache server 32 regardless of the value of the 
5 "id" portion of the URL. In contrast, traditional caching 
techniques do not ignore the "id" portion of the URL and pass 
each request for a flight schedule for flight number 32 to 
origin server 18 which causes an increased processing load at 
origin server 18. Thus, cache server 32 treats both requests: 

10 http: //www. domain.com/e_commerce/cgi -bin/query ?id=l&flight=32 

http: //www. domain. com/e_commerce/cgi -bin/query ?id-2&f light =3 2 
as requests for the same content, specifically, "flight=32" 

because cache server 22 has been configured to ignore the "id" 

portion. In contrast, traditional web caching techniques 
15 would pass both requests to origin server 18 for handling 
because the different "id" values could not be ignored. 

When cache server 32 determines that the requested 
content item is cacheable then the YES branch of decisional 
step 318 is followed to decisional step 320. When the cache 

20 server determines that the requested content item is not 
cacheable then the NO branch of decisional step 318 is 
followed to decisional step 324 . 

Next, at step 320, cache server 32 determines whether the 

requested item is available. If the requested item is 

25 available then the YES branch of decisional step 320 is 
followed to step 322 and the requested item is retrieved from 
cache server 3 2 and returned to browser 12. If the requested 
item is not available then the NO branch of decisional step 
320 is followed to decisional step 324. 
3 0 Proceeding to decisional step 324, cache server 32 

desires to request locally unavailable content from server 18 
and is requesting permission from flow control server 34 to 
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retrieve the locally unavailable content. Plow control server 
34 determines whether origin server 18 is at capacity. To 
determine whether origin server 18 is at capacity, flow 
control server 34 may determine the current processor and 
5 network load at origin server 18. In the disclosed 

embodiment, a load monitoring application (not shown) is 
provided from data center 14 to origin server 18 when origin 
server 18 contracts for service from data center 14 for 
monitoring the load at origin server 18 and communicating the 

10 load information to flow control server 34 over network 13. 
Flow control server 34 may also use other metrics and other 
methods for measuring the current load at origin server 18. 
If origin server 18 is not at capacity then the NO branch of 
decisional step 324 is followed to step 326. If origin server 

15 18 is at capacity then the YES branch of decisional step 324 

is followed to step 340. 

Then, at decisional step 32 6 , cache server 32 has 
received permission from flow control server 34 to request 

locally unavailable content from origin server 18 . Cache 
20 server 32 then determines whether request 22 was encrypted. 
If request 22 was encrypted then the YES branch of decisional 
step 326 is followed to step 328 where cache server 32 re- 
encrypts request 22 so that request 22 cannot be intercepted 
between cache server 32 and origin server 18. If the request 
25 was not encrypted then the NO branch of decisional step 326 is 
followed to step 33 0. Then, at step 33 0, request 22, whether 
it is encrypted or non-encrypted, is communicated to origin 
server 18. Next, at step 332, the content item requested by 
request 2 2 is returned from origin server 18 to cache server 

30 32. Proceeding to decisional step 334, cache server 32 
determines whether the content item retrieved from origin 
server 18 is cacheable. Cache server 32 may use criteria 
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similar to that used in step 318, or may use a different set 
of criteria. If cache server 32 determines that the retrieved 
item is cacheable then the YES branch of decisional step 3 34 
is followed to step 336 where cache server 32 stores a copy of 

5 the retrieved content item. If the item is not cacheable then 
the NO branch of decisional step 334 is followed to step 338. 
In step 33 8, regardless of whether the retrieved item is 
cacheable, the item is communicated to browser 12. 

At decisional step 340, flow control server 34 has 

10 determined that origin server 18 is at capacity and the 
processing of request 22 may be delayed. At step 340 flow 
control server 34 determines whether request 22 is requesting 
a prioritizable content item. Flow control server 34 

determines whether the requested content item is prioritizable 

15 based on various criteria. For example, at an electronic 
commerce web site, a request could be prioritized based on 

information previously gathered about the user associated with 
the request, such as how much the user has previously spent at 
the electronic commerce web site. The amount the user has 
2 0 previously spent could be used to give that user a high 
priority because of the user's repeat business, while a first 
time shopper could have a lower priority because the first 
time shopper has a lower chance of purchasing an item. 

Continuing the electronic commerce web site example, the type 

2 5 of request from the user could be considered, such as whether 
the request is for more information about an item or a buy 
request for an item. Higher priority could be attached to the 
request to buy an item that the request for further 
information. If the requested item is a prioritizable item 

30 then the YES branch of decisional step 340 is followed to step 
342 where a priority is associated with request 22 based upon 
the criteria and any of request 22, the data requested by 
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request 22 and external information associated with request 22 
in any suitable combination. The external information 

comprises information which is not part of request 22 itself, 

but has been associated with request 22 and the user which 

5 generated request 22. For example, the external information 
may comprise historical shopping, statistical or other 

information collected about the user which generated request 
22. Request 22 is evaluated against the criteria using 
request 22, the data requested by request 22 and external 
10 information associated with request 22 in order to determine 

the priority to be associated with request 22. The method 

then continues to step 346. If the requested item is not 
prioritizable then the NO branch of decisional step 340 is 
followed to step 344 where a default priority is assigned to 

15 the request. The method then continues to step 346. 

In the disclosed embodiment; the priority is a numeric 
value,. but any suitable value which has order may be used. 
For example, an equation could be used to weigh multiple 
factors in order to come up with a single priority value. The 

20 numeric values range from 1 to 100, inclusive, where higher 
values indicate higher priority. The default priority may be 
set to any value as desired by an administrator associated 
with origin server 18 and in the disclosed embodiment may be 
from 1 to 100, inclusive. When a significant portion of the 

25 content available at origin server 18 is prioritizable, the 
default priority may be set to a low value, such as 25, 
because only unimportant activities are non-prioritizable. 

When a significant portion of the content available at origin 
server 18 is non-prioritizable, the default priority may be 
30 set to a middle value, such as 50, and the prioritizable items 

given high and low priorities based on their relative 
importance. The middle value default priority may be used 
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because most requests are roughly equal in importance to the 
administrator associated with origin server 18 . Any suitable 
distribution, formula or other method of calculating 
priorities may be used with any suitable value for the default 
5 priority based on the desires of the administrator. Origin 
servers 18 may have each distinct priority policies associated 
therewith so that flow control server 34 can provide 
customized prioritizing service to each origin server 18. 

Next, at step 346, the prioritized request is placed in a 
10 queue. The queue is sorted based on the priority associated 
-=0 with the prioritized request so that the highest priority 

M requests are handled first . 

j |i Proceeding to decisional step 348, flow control server 34 

L ^ determines whether the delay before the prioritized request is 

;5 15 handled by origin server 18 exceeds a predetermined time 

limit. The time limit is configurable based on origin server 
'rz 18 . Specifically, since each data center 14 may be serving 

□ multiple distinct origin servers 18, each respective origin 

server 18 may have differing requirements for the status page 
20 and the time limit. Thus, flow control server 34 is operable 

to provide differing time limits and status pages based on the 
particular origin server request 22 was intended for. The 
status page is also configurable as a function of request 22 . 
If the delay is greater than the predetermined time limit then 
25 the YES branch of decisional step 348 is followed to 
decisional step 350. At decisional step 350, flow control 
server 3 4 determines whether the requested item should have a 
status page sent . 

The status page may comprise any suitable HTML data for 
30 use with browser 12. The location of the status page is 
indicated by flow control server 34 to provide information 
regarding the status of the processing of request 22 to the 
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user associated with browser 12. The status page may be 
located at, for example, cache server 32 or at other suitable 

locations. In situations where origin server 18 is incapable 
of handling the volume of requests for locally unavailable 
5 content, the status page may be generated. The status page 
may include an automatic resubmission element which causes 
browser 12 to communicate the status page back to web server 
3 0 after a predetermined period of time. The status page may 
be resubmitted in order to update the status page with the 
10 status of request 22 after the predetermined time has elapsed 
or to return the content originally requested if the 

originally requested, content is avail able. 

If the requested item should have a status page sent then 
the YES branch of decisional step 350 is followed to step 351 

15 where a status page is communicated to browser 12 to inform 
the user associated with browser 12 of the status of request 
22. If no status page should be sent then the NO branch of 
decisional step 350 is followed to decisional step 352. If 
the delay before the prioritized request is handled is below 

20 the predetermined time period then the NO branch of decisional 
step 348 is followed to step 352 . 

At decisional step 352, flow control server 34 determines 

whether origin server 18 is still at full processing capacity. 
If origin server 18 is not at full capacity then the NO branch 

25 of decisional step 352 is followed to step 326 where the 
highest priority request in the priority queue is communicated 
to origin server 18 for handling . If origin server 18 is at 
capacity, for example because origin server 18 is handling 
content requests previously communicated to origin server 18, 

3 0 then the YES branch of decisional step 352 is followed to step 
3 54. At step 3 54, flow control server 34 waits a 

predetermined period of time before returning to step 3 52 and 
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determining whether origin server 18 is still at capacity. 
For example, flow control server 3 4 may wait 10 0 milliseconds 
before determining whether origin server 18 has dropped below 
capacity. 

5 System 10 provides numerous capabilities for improving 

web server performance and reliability. System 10 provides 
the capability to dynamically expire content for fine grained 
control of whether cached data is valid or invalid. Changes 
at a data store, such as data source 20, can be quickly 
10 detected and the cached data from the data store expired so 
that out-of-date information is not provided to a user. An 
administrator associated with a web site, such as origin 

server 18, can now determine in real-time when cached content 

should be expired. In contrast, administrators have 
15 traditionally had to determine when content should be expired 
when the user initially retrieves the content from the web 
server because traditional expiration times were fixed and 
provided as part of the content . Because the administrator 
could not provide for unexpected changes in the content, such 
20 as a flight in a flight schedule being cancelled, 
administrators often used extremely short expiration times to 
ensure that current data is provided to users. The short 
expiration times cause unnecessary increased load on the 

servers at the web site because content could only be cached 

25 for short periods of time before being re-retrieved from the 
web site, even though the cached data might still be correct. 

Also, system 10 provides the ability to cache both static 
and dynamic content. By caching both static and dynamic 
content remotely from a web site, the processing and network 

30 load on the web site is decreased. The processing load is 
decreased because the web site no longer has to generate 
multiple instances of dynamic data. The network load is 
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decreased because many requests for content can be handled by 
a data center, such as data center 14, where the content is 
cached instead of using the web site, such as origin server 
18, to handle the requests. Caching of both static and 
5 dynamic data allows a web site to off-load the handling and 
providing of not only simple static content, but many items of 

both static and dynamic content. 

Further, system 10 provides the capability to cache 
portions of dynamic content and configurably determine which 
10 portions to cache based on instructions from an administrator 
associated with a web site. This capability greatly increases 
the type and variety of dynamic content which can be cached by 
system 10. This capability also increases the effectiveness 
of the caching performed by system 10 because system 10 can 

15 filter the requests. Filtering requests allows system 10 to 
determine that distinct requests with differing content 
requests are actually requesting already cached content. This 
decreases the load experienced by the web site. 

In addition, system 10 can control the load experienced 

20 by a web site, such as origin server 18, by prioritizing and 
controlling the flow of requests to the web site. An 
administrator associated with the web site can request that 
system 10 not allow the processing and network load to exceed 
certain thresholds. Thus, the administrator can now make 

25 various assumptions about the volume and frequency of requests 

that will be experienced by a server associated with the web 
site and plan accordingly. Also, by prioritizing the 

requests, the delay experienced by a user who is requesting 
content from a busy web site can be tailored by the 
3 0 administrator to provide appropriate service to the users of 
the administrator's web site. 
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Also, system 10 allows a particular site, such as origin 
server 18, to purchase sufficient bandwidth and processing 

capability to handle the normal expected load on the site, 
while system 10 handles the spikes in load on the site. This 
5 provides numerous cost savings to the site because the site 

does not need to purchase and maintain large amounts of 
processing capability and bandwidth that sit around unused 
except during the heaviest traffic. 

It should also be recognized that direct connections 
10 disclosed herein could be altered such that two disclosed 
components or elements would be coupled to one another through 
an intermediate device or devices without being directly 
connected, while still realizing the present invention. Other 

changes, substitutions and alterations are also possible 
15 without departing from the spirit and scope of the present 
invention, as defined by the following claims. 



DAL01: 489003. 3 
066241 . 0105 



ATTORNEY'S DOCKET 
066241,0105 



72 



PATENT APPLICATION 



WHAT IS CLAIMED IS: 

1. A method for data processing comprising: 
receiving a data request at a data center ; 

assigning a priority to the request; 
5 queuing the request as a function of the priority 

associated with the request; and 

retrieving the requested data from an origin server. 

2. The method for data processing according to Claim 1, 
10 wherein the data center comprises a cache server, a flow 

control server and a web server. 

3 . The method for data processing according to Claim 1 
further comprising: 

15 determining a load associated with the origin server; 

controlling at a flow control server retrieval of data 

from the origin server by a cache servers- 
granting permission from the flow control server to the 

cache server to retrieve data when the load associated with 

2 0 the origin server is below a threshold; and 

denying permission from the flow control server to the 
cache server to retrieve data when the load associated with 
the origin server is above the threshold. 

25 4. The method for data processing according to Claim 3, 

wherein the threshold is determined as a function of the load 
of the origin server. 

5. The method for data processing according to Claim 1, 

3 0 wherein assigning the priority to the request comprises 

determining the priority based on a state associated with the 

request . 
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6. The method for data processing according to Claim 1 

further comprising communicating alternate content to a remote 
computer associated with the request. 

5 7. The method for data processing according to Claim 6, 

wherein the alternate content comprises a status page and 
wherein the status page is communicated when the load at the 
origin server exceeds a predetermined threshold. 

10 8. The method for data processing according to Claim 6, 

wherein communicating the alternate content comprises: 
associating a queue delay time with the request; 
determining whether the queue delay time exceeds a 

threshold; 

15 generating the alternate content as a function of 

predetermined criteria associated with the origin server when 
the queue delay time exceeds the threshold; and 

returning the alternate content to the remote computer. 



20 9. The method for data processing according to Claim 8, 

wherein the data requested is generated at a client computer 
remote from the data center and wherein generating the 
alternate content comprises selecting the alternate content as 
a function of the bandwidth available to the client computer. 

25 

10. The method for data processing according to Claim 9, 

wherein generating the alternate content further comprises 

determining the amount of bandwidth available to the client 
computer, wherein the amount of bandwidth comprises one of 
3 0 high-bandwidth, medium-bandwidth and low-bandwidth. 
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11. The method for data processing according to Claim 8, 
wherein generating the alternate content further comprises 
generating the alternate content based on the queue delay time 
and the predetermined criteria. 

5 

12. The method for data processing according to Claim 8, 
wherein the predetermined criteria comprises information 
associated with the request. 

10 13. The method for data processing according to Claim 8, 

wherein the predetermined criteria comprises external 
information associated with a user associated with the 
request . 

15 14 . The method for data processing according to Claim 

13, wherein the external information comprises historical 
shopping information associated with the user associated with 
the request . 

2 0 15. The method for data processing according to Claim 6, 

wherein the alternate content comprises a status page and 
further including resubmitting the data request to the data 
center by a browser to update the status page. 

25 16. The method for data processing according to Claim 

15, wherein resubmitting the data request is performed 
automatically by the browser. 
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17. The method for data processing according to Claim 1, 
wherein assigning the priority to the request comprises 
determining whether the request is prioritizable, and wherein 
the priority is a first priority when the request is non- 

5 prioritizable and wherein the priority is a second priority 

when the request is prioritizable. 

18. The method for data processing according to Claim 
17, wherein the first priority is a default priority and 

10 wherein the second priority is determined as a function of the 
data requested by the data request . 

19. The method for data processing according to Claim 1 
further comprising determining the load on an origin server by 

15 comparing a load metric associated with the origin server to ,a 
predetermined threshold . 

20. The method for data processing according to Claim 

19, wherein the load metric comprises the number of requests 
2 0 being handled by the origin server. 

21. The method for data processing according to Claim 
19 , wherein the load metric comprises the number of network 
connections being supported by the origin server. 

25 

22. The method for data processing according to Claim 
19, wherein the load metric comprises the delay associated 
with retrieving a predetermined web page from the origin 

server. 

30 



DAL01:489003 .3 
066241 . 0105 



ATTORNEY'S DOCKET 

066241 . 0105 



76 



PATENT APPLICATION 



23. The method for data processing according to Claim 1, 
wherein the request is a first request and. wherein queuing the 
request comprises : 

storing the first request in a queue associated with the 
5 data center, the queue having therein a plurality of second 
requests distinct from the first request, each of the second 
requests having a respective priority associated therewith; 

sorting the queue as a function of the respective 
priority associated with the first request and each of the 
10 second requests; and 

wherein retrieving the requested data comprises: 
processing the highest priority request in the queue by 

the origin server. 

15 24. The method for data processing according to Claim 1, 

wherein the data center comprises a plurality of data centers 

and wherein receiving the request comprises: 

determining a network distance between a client computer 

and at least one of the data centers; 
20 determining the closest data center to the client 

computer; 

resolving a destination address associated with the 
request to the closest data center; and 

routing the request to the closest data center. 
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25. A system for dynamic flow control comprising: 

a cache server operable to receive a request for content 
and retrieve content from an origin server in response 
thereto; and 

5 a flow control server having an associated queue and 

coupled to the cache server, the flow control server operable 
to: 

assign a priority to the request; 

store the prioritized request in the queue as a 
10 function of the priority associated with the request; and 

regulate the retrieval of content from the origin 
server by the cache server. 

26. The system for dynamic flow control according to 
15 Claim 25, wherein the cache server is further operable to 

request permission from the flow control server to retrieve 
content from the origin server. 

27. The system for dynamic flow control according to 
20 Claim 25, wherein the flow control server is further operable 

to associate a priority with the request based on the content 
requested by the request and determine a processing load 
associated with the origin server. 

2 5 28. The system for dynamic flow control according to 

Claim 25, wherein the flow control server is further operable 
to associated a priority with the request based on external 
information associated with the request. 
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29. The system for dynamic flow control according to 

Claim 28, wherein the external information comprises 
historical information associated with a user associated with 
the request. 
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30. A method for dynamic flow control comprising: 
receiving a data request at a flow control server; 
associating a priority with the data request; 

storing the data request in a queue as a function of the 
5 priority associated with the data request; and 

retrieving data requested by the data request from a 
protected resource. 

31. The method for dynamic flow control according to 
10 Claim 3 0 further comprising determining a load at the 

protected resource, and wherein associating a priority with 

the data request comprises associating the priority with the 
data request when the load at the protected resource exceeds a 
predetermined threshold and wherein queuing the data request 
15 comprises queuing the data request when the load at the 
protected resource exceeds the predetermined threshold. 

32. The method for dynamic flow control according to 
Claim 3 1 further comprising indicating status information to 

20 be returned to a remote computer associated with the data 
request . 

3 3 . The method for dynamic flow control according to 
Claim 31 further comprising indicating alternate content to be 

2 5 returned to a remote computer associated with the data 

request . 

34. The method for dynamic flow control according to 
Claim 33 further comprising determining the alternate content 

3 0 as a function of the bandwidth associated with the remote 

computer. 
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35. The method for dynamic flow control according to 

Claim 32 , wherein the status information comprises a status 

web page and wherein indicating status information comprises 

selecting the status page based on priority criteria 
5 associated with the protected resource. 



36. The method for dynamic flow control according to 

Claim 35 further comprising resubmitting the data request 

after a predetermined time interval to the flow control 
10 server. 



37. The method for dynamic flow control according to 
Claim 36, wherein resubmitting the data request is performed 
automatically. 

15 

38. The method for dynamic flow control according to 
Claim 35 further comprising delaying communication of the 
status page indication for a predetermined time interval based 
on the priority criteria. 

20 

39. The method for dynamic flow control according to 
Claim 38, wherein delaying communication of the status page 
comprises delaying communication of the status page indication 

based on an expected delay associated with the protected 

25 resource. 



40. The method for dynamic flow control according to 
Claim 31, wherein receiving the data request comprises 
receiving the data request from a flow controlled device. 

30 
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41. The method for dynamic flow control according to 
Claim 31, wherein retrieving the data requested by the data 
request comprises: 

removing at least one prioritized data request from the 

5 queue when the load at the protected resource is below the 
predetermined threshold; 

communicating the prioritized data requests to the 

protected resource; 

retrieving the requested data from the protected 
10 resource; and 

communicating the requested data to a flow controlled 
device associated with the data request. 

42. The method for dynamic flow control according to 
15 Claim 41, wherein removing at least one prioritized data 

request comprises removing the highest priority data request. 

43. The method for dynamic flow control according to 
Claim 41 7 wherein associating the priority with the request 

2 0 comprises determining whether the request is prioritizable, 
and wherein the priority is a first priority when the request 
is non-prioritizable and wherein the priority is a second 
priority when the request is prioritizable. 

25 44 . The method for data processing according to Claim 

43, wherein the first priority is a default priority and 
wherein the second priority is determined as a function of the 
request . 
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45. A method for content filtering comprising: 

receiving a data request at a flow control server from a 
flow controlled device, the data request having associated 
requested data; 
5 evaluating the requested data against criteria; 

discarding the data request when the requested data is 
forbidden by the criteria; 

determining alternate data based on the criteria and the 
requested data when the requested data is forbidden by the 
10 criteria; and 

indicating the alternate data to the flow controlled 
device when the requested data is forbidden. 



46. The method of content filtering according to Claim 

15 45, wherein the criteria comprises the geographic origin of 
the data request . 



47 . The method of content filtering according to Claim 
45, wherein the criteria comprises the content of the 

20 requested data. 



25 



48. The method of content filtering according to Claim 
45, wherein the criteria comprises the geographic origin of 
the data request and the content of the requested data. 

49. The method of content filtering according to Claim 
45, wherein the alternate data comprises a content forbidden 
web page . 



30 50. The method of content filtering according to Claim 

45, wherein the alternate data comprises an allowable web 
page. 
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51. The method of content filtering according to Claim 
45, wherein indicating the alternate data comprises returning 
the alternate data to the flow controlled device. 

52 . The method of content filtering according to Claim 
45, wherein indicating the alternate data comprises returning 
a location of the alternate data to the flow controlled 
device . 
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METHOD AND APPARATUS FOR DYNAMIC DATA FLOW CONTROL 

ABSTRACT OF THE DISCLOSURE 

A data center (14) receives a request (22) for content 
5 from a browser (12) on a client (11) . The data center 
determines whether the requested content is available at the 
data center. The content is available when the content is 
both present at the data center and current . The content may 

be expired and marked as non-available in response to an 

10 expiration command (24) . When the requested content is 
available at the data center, the data center returns the 
requested content to the data center. When the requested 
content is locally unavailable at the data center, the 
requested content is retrieved from an origin server (18) . 

15 The retrieval of the content from the origin server may be 
delayed based on the processing load at the origin server. 

When retrieval of the content is delayed, the request is 

prioritized and placed in a queue for handling by the origin 
server based on the priority of the request. Also, when 
2 0 retrieval of the content is delayed, a status page may be 
communicated to the browser to inform a user of the delay and 
provide alternate content (13 9) and status information related 
to the request determined as a function of the request or the 
current state of the origin server. 

25 
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